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In order to ascertain the degree of musical ability in all first-year 
boys it was decided to carry out tests using material derived from 
similar investigations by Dr. C. H. Wing of London (16). By 
applying this material to two carefully selected groups it was 
hoped, inter alia, to evolve a shorter battery of tests which could 
be used to ascertain musical ability. 

While seeking a general approach to the question the author 
consulted past work in the field of musical ability, and on reading 
James L. Mursell's The Psychology of Music, came across a state- 
ment that formed the basic approach to his problem: “We must 
try,” wrote Mursell, “our developed tests upon individuals known 
to be conspicuously musical and those known to be conspicuously 
nonmusical to try to discover where the most crucial and significant 
performances are located. Strange to say,” he added, “this obvious 
procedure has only been adopted by Stumf and by Ravecz—and 
in each case only a single individual subject was involved” (10). 
Dr. Wing used a battery of seven tests which included tests of 
perceptual efficiency and tests of musical preference. In some of 
his tests Wing’s approach was that of the Gestalt psychologists—a 
school of psychology which emphasized the perception of patterns, 
shapes, and tunes as wholes. The primary object of Wing’s research 
was “to examine the hypothesis of a general factor for musical 
ability, and to explore the influence of more specialized factors 
such as might obscure this general factor or interfere with its 
measurement.” Wing found that there was no great difference as 
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regards the efficiency of the individual tests, but found the phrasing 
test which had a saturation coefficient of nearly 0.80 the best. 

In order to find the most significant differences in performance 
it was proposed to follow up Mursell’s premise and to try Wing's 
tests upon individuals known to be conspicuously musical and 
those known to be conspicuously nonmusical. From this it was 
hoped to reduce Wing's battery of seven tests to a more economical 
number—if possible to two or three. Most important, then, was 
the selection of the two groups. Mursell's use of the phrase **con- 
spicuously musical" made it imperative to select only those pupils 
who had proved themselves both in the theory and practice of 
music. Ages in this group ranged from thirteen years eleven months 
to eighteen years six months, the mean age being sixteen years 
three months. My idea was to select pupils from the sixteen to 
eighteen age group, for by doing so I was certain of obtaining 
only those who had advanced music certificates. However, four 
pupils under fifteen years of age were included in the group. I 
knew two of these pupils to be conspicuously musical, and informa- 
tion concerning the other two was given by their respective college 
music directors. Two pupils possessed Associate Diplomas of 
recognized music colleges, while two possessed the Licentiate. 
The remainder of the group had reached the higher grades and 
had gained either honours or merit in their theory and practical 
examinations. In all but four cases there was musical activity 
in the home. All twenty-four of the musical group were active 
participants in the musical life of their respective schools—orches- 
tras, bands, choral clubs, and so on. Five of the group had shown 
promise in the field of composition and had actually had their 
Works performed in public. 

The second group was more difficult to select. To follow Mursell's 
premise faithfully this group had to be conspicuously nonmusical. 
Selection was based mainly on: first, complete dislike for music, 
and second, lack of musicality in parents, brothers, and sisters. 
For a final selection I used Wing’s questionnaire. Subjects were 
asked to place a tick against the word that most nearly described 
their general attitude towards music: (a) very interested; (b) 
interested; (c) indifferent; (d) dislike. 

They could add a plus or minus sign if they desired finer shades 
than those given. Secondly, the following was given to elicit infor- 
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mation about their own musical activities and those of their par- 
ents, brothers, and sisters: Does anyone at home play an instru- 
ment? If so, state who they are, what they play, and roughly how 
much, e.g., "father, oboe, frequently." They were asked to use 
terms “frequently,” "occasionally," or "seldom" to describe the 
amount of playing activity. For my final nonmusical group I 
selected those pupils who: (a) placed a tick against dislike; (b) re- 
corded a “nil” answer to musical activity at home; (c) did not 
play an instrument and were not interested in learning to do so; 
(d) were not keen about school musical activities such as school 
singing (this was an individual question). Ages in this group ranged 
from fifteen years to seventeen years eleven months, the mean 
age being sixteen years six months. 

Besides these two main groups I included a group of three pupils 
who, having had some years of music instruction, had given up 
because of lack of progress. I thought that, although a very small 
group, they might provide some further evidence to aid final 
analysis. For the experiment I would have liked a very much 
larger group, but the fact that testing had to be done out of school 
hours made it most difficult to collect a group enthusiastic enough 
to take the tests. As far as the musical group was concerned, 
directors of music of the secondary schools concerned sent only 
their best pupils. I considered this to be an excellent group. 

To ascertain ability to appreciate certain aspects of musical 
material, the complete battery of Wing’s tests was administered. 
This included chord analysis, pitch change, memory tests, rhythmic 
accent, harmony, intensity, and phrasing. While making prelimi- 
vents for the investigation, approach had been made 
isic teachers and others interested in music and 
their opinions sought on the nature of musical ability. Quite a 
number were emphatic about the possession of good intelligence. 
However, one particular teacher of pianoforte made this reply, 
“Tt is all a question of head and hand.” Whether manual dexterity 
would prove to be a factor or not prompted me to include the 
Minnesota Rate of Manipulation Test. The first two individual 
tests were used—placing, and turning and placing. The first test 
involved the use of one hand, the second, two hands. To test 


intelligence Raven’s Progressive Matrices, 1947, was used. 
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ANALYSIS OF THE DATA 


Although sixty pupils were tested two g 
four pupils were selected for final diagnosis 
ascertain significant differences betwee: 
two groups. The critical ratio proved t 
ratios were obtained between the two 


roups cach of twenty- 
- The first task was to 
n the musical scores of the 
0 be 10.5. Similarly, critical 
groups on all musical tests: 


Test 1 7.02 
Test 2 8.78 
Test 3 6.50 
Test 4 4.27 
"Test 5 11.25 
Test 6 4.64 
Test 7 3.42 


At the 0.01 level all the ratios were highly significant. The two 
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TABLE I—ConuELATION MATRIX AND First Factor SOLUTION 
(Nonmusic group) 


Test 1 2 3 4 5 6 7 Raven Z 
1 0.31 0.28 0.10 0.31 0.24 0.10 0.27 0.21 1.82 
2 0.28 0.63 0.34 0.51 0.02 0.00 0.50 0.63 2.91 
3 0.10 0.34 0.46 0.13 0.14 0.46 0.27 0.32 2.22 
4 0.31 0.51 0.13 0.51 0.08 0.15 0.25 0.20 2.14 
5 10.24 0.02 0.14 0.08 0.28 0.00 0.28 0.17 1.21 
6 0.10 0.00 0.46 0.15 0.00 0.46 0.22 0.00 1.29 
7 0.27 0.50 0.27 0.25 0.28 0.22 0.50 0.40 2.69 
Raven 0.21 0.63 0.32 0.20 0.17 0.00 0.40 0.63 2.56 
| 
x n 2.91 2.22 2.14 1.21 1.39 2.69 2.56 16.94 


First factor loud 
ings pu 0.71 0.54 0.52 0.29 0.34 0.05 0.62 


The mean communality of the first factor of this group was 0.282. 


not only the ability to educe relations but also correlates. This is, 
of course, the idea underlying the matrices test. Test 3 had a 
saturation of 0.54. The test involved detection of a change of 
note in a short melodie phrase—a test which demanded retentivity. 

It is not often an easy matter to listen first to a melodie phrase 
of a given number of notes, and then when the phrase is repeated 
to say which note is changed. However, the mere fact that test 2 
involved a relationship of two notes—one higher or lower than 
the given note—and test 7 differentiated between staccato and 
legato playing may simply mean that both tests were measuring 
sensitivity. Whatever the factor operating through the perform- 
ances of the nonmusical group, it would appear to be a significant 
element in the Raven test performance. 

There appeared to be a factor which was responsible for some 
forty-four per cent of the performance in the musical group (‘Table 
II). The Raven loading was not as high in this group as it was 
in the case of the nonmusical group. Something was therefore 
operating through the performances of the musical group that 
was over and above intelligence as measured by the matrices 
test. Test 4 (judging the better rhythmic accent) and test 5 
(judging the more appropriate of the two harmonizations) were 
the most outstanding performances. As all the first factor loadings 
were high, one may conclude a common factor running through 
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TABLE II—ConnELATION MATRIX AND First Factor SOLUTION 
(Musical group) 


Test 1 2 3 4 5 6 7 Raven EZ 
1 0.66 0.13 0.66 0.57 0.39 0.39 0.46 0.36 3.62 
2 0.13 0.54 0.17 0.54 0.47 0.37 0.44 0.18 2.84 
3 0.66 0.17 0.66 0.64 0.53 0.31 0.16 0.42 3.55 
4 0.57 0.54 0.64 0.65 0.65 0.46 0.40 0.40 4.31 
5 0.39 0.47 0.53 0.46 0.65 0.37 0.57 0.47 4.10 
6 0.39 0.37 0.31 0.40 0.37 0.52 0.41 0.52 3.35 
7 0.46 0.44 0.16 0.40 0.37 0.49 0.57 0.20 3.21 
Raven 0.36 0.18 0.42 0.40 0.47 0.52 0.20 0.52 3.07 
z 3.62 2.84 3.55 4.31 4.10 3.35 3.21 3.07 28.05 
First factor load- 
ings 0.68 0.54 0.67 0.81 0.77 0.63 0.60 0.58 


The mean communality of the first factor of this group was 0.445. 


all the tests. A residual matrix was obtained and McNemar’s 
criterion applied.! 

What, then, was this factor affecting the performances in all 
the tests? I hesitated to call it “musical ability.” The group was a 
homogeneous one, and the critical ratios between the two groups 
was so highly significant that I could but regard the factor as 
one of “musical experience"—experience plus an intellectual 
element. This intellectual element I would not confuse with Spear- 
man’s “g.” The experiential element would certainly account for 
the high saturation in test 5 (harmony), where ears trained in 
aural perception would easily detect those melodic lines in which 
chord progressions were obviously incorrect. The first factor 
loading in test 4 was 0.81. In this test Wing had included classical 
extracts which were obviously known by members of the musical 
group, for one had only to study their faces during the test perform- 
ance to realise that tunes were familiar. 


1 MeNemar's criterion: MeNemar isolated factors until G, reached or fell 
1 


below 4/N 
Where G, = G, + (1 — My?) 
Where G, = SD of residuals after s factors 
In the case of the residual Matrix for the musical group G; = .19 which fell 


below 0.20, i.e. (1/4/24). Thus only one factor could be isolated from 
the performances of the musical group. 
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And so, to attempt an answer to Mursell's premise I could 
only suggest that the differences in performance lay in the fact 
that, in the case of the musical group, “experience” played a 
most important part. 

It is worthy of note here that Wing obtained his three factors 
from an analysis of the results of his tests on a group of forty-three 
boys aged fourteen to sixteen. He chose this group because it 
was the largest one to which he had given the tests twice. His 
group was not, therefore, a homogeneous one. The mere fact 
that my nonmusical group lacked musical experience had, in all 
probability, accounted for low test scores. Worthy of note also 
was the fact that those subjects who took the tests but who were 
rejected because they had learned an instrument, or because of 
some form of musical expression in the home, scored a little higher 
than members of the nonmusical group. The critical ratios of the 
three girls who had had some three to four years of musical in- 
struction were also significant and marked them off as a group 
whose performances in the music tests were much better than 
those of the nonmusical group. 

The second part of the investigation involved the selection of a 
more economical battery of tests. Wing himself had stated that 
he did not object to a smaller number of tests. But to select such 
predictive tests from the original battery of seven it appeared 
that certain criteria should be satisfied. Tests should yield signifi- 
cant correlations with musical age in both the musical and the 
nonmusical groups; as the factor accounting for the best inter- 
correlations in the musical group appeared to involve an appre- 
ciative experiential element, the predictive tests should have 
comparatively low loadings in that factor; the intercorrelations 
among the predictive tests should be as low as possible. 

Tests 2, 3, and 7 appeared to satisfy these criteria very well: 
they yielded significant and relatively constant correlations with 
musical age in both groups and their intercorrelations were low. 
When regression weights, determined by the Aitken-Thomson 
method of pivotal condensation, were applied, these three tests 
yielded most significant multiple correlations. Multiple correlation 
with the musical group was 0.92 and with the nonmusical group 
0.90. Tests yielding very high correlations only in the musical 
group (tests 3, 4, and 5) when similarly weighted failed to give 
multiple correlations of any greater magnitude than the coefficients 
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yielded by those tests in the two groups. This suggested that 
those particular tests were highly influenced by the “experiential 
factor" isolated in the musical group. It appeared, then, that as 
tests for selection purposes, tests 2, 3, and 4 were greatly to be 
preferred and appeared to be promising predictive instruments. 

The manual dexterity tests failed to give any significant results. 
In the musical group the correlation of musical age and manual 
test 1 (placing) was 0.17, and with manual test 2 (turning and 
placing), 0.10. The correlations in the nonmusical group were 
lower still: 0.10, manual test 1, and —0.18 with manual test 2. 
No attempt was made to correlate the Raven percentiles with 
any of the manual dexterity tests. I had hoped that there would 
have been some significant correlation between musical age and 
manual test 2 which involved the use of both hands. Observance, 
however, indicated that facility at the piano in no way helped to 
turn and place the blocks quickly—it appeared to be even a hin- 
drance. A pianist’s hands and fingers are trained to a certain 
pattern, and I found that most of the good pianists of the musical 
group fumbled and dropped blocks a good deal. The only one who 
performed really well was a Chinese lad in the nonmusical group. 
As far as manual test 1 was concerned, I thought that the good 
violin players in the musical group might have performed well as 
this test called for dexterity in the bow hand. Such, however, was 
again not the case. It appeared, then, that manual dexterity could 
be ruled out as a factor to be considered as far as musical perform- 
ance as measured by the Wing series of test was concerned. 


SUMMARY 


To assess musical ability and appreciation in the light of Mur- 
sell’s premise and to select an economical battery of Wing’s musical 
tests the following were administered: a) The Wing tests of musical 
intelligence, b) The 1947 Raven progressive matrices intelligence 
test (Australian Council of Educational Research), c) The Min- 
nesota tests of manual dexterity (tests 1 and 2). 

Tests were given to two carefully selected groups—musical, 
and nonmusical. 

Factor analysis on the lines described by Cyril Burt and using 
the Thurstone centroid method of factor analysis was applied. The 


results seemed to indicate that the musical group was superior to 
the nonmusical group bec 


ause of musical experience, an experience 
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which accounted for some forty-four per cent of the performance. 
In the nonmusical group one was isolated which possibly utilized 
intelligence in a finer form to appreciate a “gestalt” quality. 

Using the Aitken-Thomson method of pivotal condensation 
three tests yielded significant multiple correlations. These were 
tests 2, 3, and 7. 

The correlation of performances in the musical tests with the 
matrices intelligence test was significant but not high. 

Tests of manual dexterity were not to be regarded as playing 
any part in the diagnosis of musical ability as far as the Wing 
tests were concerned. 

Further developments are possible using tests 2, 3 and 7 on 
untrained groups who wish to become musicians, with subsequent 
assessment of the predictive. value of these tests when reliable 
measures of musical progress are available. 
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THE EFFECT OF INDUCED TENSION DURING 
TRAINING ON VISUAL FORM RECOGNITION:: 


SYLVIA R. MAYERS 


Physical Research Laboratories, Boston University 


Visual form recognition plays a cental róle in many educational 
experiences ranging from the child’s early encounter with alpha- 
betical symbols to the adult’s identification of biological specimens 
or aircraft. Visual form learning, which underlies this important 
skill, is thereby an exceptionally fruitful area for research. Numer- 
ous factors in visual learning affect the efficiency of recognition. 
This study, however, deals only with the effect of induced muscular 
tension during learning upon subsequent recognition. 

Many empirical analyses and speculative accounts appear to 
support the proposition that induced muscular tension influences 
learning and that an optimal amount is facilitative. From this 
proposition two generalizations, which provide the rationale for 
this study, have been extended to the classroom learning situation. 
The first states that learning is facilitated by the arousal of a 
hypothetical condition of “attention” or tension which is both 
instigated and revealed by observable muscular activity, as for 
example in the maintenance of erect posture. Support for this 
view has been drawn from many laboratory studies (e.g., 1, 8, 4, 8) 
which have systematically induced tension by procedures such as 
weight lifting or dynamometer performance. Some of these ex- 
periments demonstrate facilitation, but many do not. These 
equivocal results have recently been partially clarified by a verbal 
learning study (2) which demonstrates that induced tension facili- 
tates performance on a verbal task but has no effect on learning. 
This finding, however, cannot be generalized to the learning of 


! This work, sponsored by the United States Air Force (Aerial Recon- 
naissance Laboratory, Wright Air Development Center), was performed 
by Boston University under Contract No. AF 33-038-15615. 

* Portions of this paper were presented at the 1955 meetings of the Amer- 
ican Psychological Association at San Francisco. 

$ This article reports a part of a dissertation presented to the Graduate 
School of Boston University in partial fulfillment of the requirements for 
the degree of Doctor of Philosophy. 
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Ficure 1. An example of a training form (left) and recognition test 
(right). 


TABLE I 
Presentation Order of Training Conditions 
Subjects 
1 2 3 4 5 6 
x]|9)|x|n|s|7i9s 
olgn] Z 
y | pvp | «| T | # | w | o 
p|x | T | * | & |] o | # 
x | T [| | w | o | w | D 
T | « | w | 9 | KK | D | M 
% | 3% | o | « | D | * | T 


seven ten-item, equated sets of training forms and recognition 
tests. Each test card contained one form identical to the training 
form and three similar ones. An example of training forms and 
tests is shown in Figure 1. 

Design. The design used required that each S learn one set of 
forms in each of the seven conditions: 0, 1$, 4, 1$ and 34 maxi- 
mum dynamometer tension; tracing (T); and drawing (D). The 
order of presentation of conditions was as in Table I. 

Procedure. The seven experimental sessions for each S took 
place within five days, not more than two a day at a minimum 
interval of three hours. At each session S performed on one set, of 
forms. In all conditions each of the ten items in a training set was 
presented for fifteen seconds and spaced at twenty seconds. A 
ten-minute rest was interpolated between training and testing. 
The ten items in a recognition test were presented for five seconds 
each and spaced by ten seconds. S was required to select from 
four possibilities that form which he had memorized during train- 
ing. The number of correct recognitions served as the primary 
response measure. 

At the beginning of every experimental session S's maximum 
dynamometer grip with his left hand was recorded at the end of 
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fifteen seconds of pressure. E then read instructions for the activity 
of the condition and emphasized the need for careful examination 
of each form in anticipation of the subsequent recognition test. 
Three practice forms were presented by the method for the session 
followed immediately by the recognition test. Correetion, addi- 
tional instructions and practice were provided so that S could 
achieve perfect performance on the three practice forms. 

A three-minute rest was allowed between practice and training. 
For the L6, 14, 14 and 24 tension conditions the maximum dynamo- 
meter reading of the session was appropriately fractionated and 
the adjustable tachistoscope switch was positioned on the dy- 
namometer dial at the resulting value. Each form was exposed in 
the tachistoscope when S exerted the required pressure on the 
dynamometer handle upon a signal from E. If S overshot or under- 
shot the range of three kilograms from the required point, the 
buzzer sounded until S corrected his grip, but the visual form 
remained. 

In the drawing condition S copied each form onto a card posi- 
tioned beside the tachistoscope aperture; in the tracing condition 
he traced each form onto a plexiglas slide placed over the tachisto- 
scope aperture; in the zero-induced tension condition he observed 
each form, his two arms resting on the tachistoscope stand. Expo- 
sures were controlled by E in the latter three conditions. To 
prevent S from viewing his completed drawing or tracing, a small 
shutter on the headrest was released at the end of each exposure. 
During the twenty-second interval E replaced the soiled cards or 
plexiglas slides with clean ones. The shutter was used in all con- 
ditions to maintain experimental uniformity. 


RESULTS 


Table II presents the mean number of correct responses and 
standard deviations (SDs) in the seven training conditions. In- 
spection of this table and Figure 2 shows that (a) increased dy- 
namometer-induced tension is associated with decreased recogni- 
tion proficiency, (b) drawing is associated with an increase in 
recognition proficiency over zero-induced tension, and (c) tracing 
is associated with a decrease in recognition proficiency from zero- 
induced tension. 

The analysis of variance of recognition scores with 0, 1g, 4, 4 
and 34 dynamometer induced tension reveals statistically signifi- 


——————— 
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TABLE II—RECOGNITION Scores For ALL TRAINING 
Conpitions (N 1N Eacu Conpition = 28) 


Training Condition* 


Measuret 
| 0 M M M x D T 
Mean 6.14 | 5.28 | 5.00 | 4.75 | 4.75 | 7.46 | 4.78 
Standard deviation 1.95 | 2.30 | 1.58 | 1.56 | 1.36 | 1.31 | 2.15 
*0, 1$, 14, M, 34 maximum dynamometer pressure; drawing (D); trac- 
ing (T). 
1 Recognition scores are based on number of correct responses to 10 
test items. 
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FicunE 2. The effect of induced tension during training on visual form 
recognition. 


cant differences. The F obtained for tension conditions is 3.97 for 
four and one hundred and eight df (p < 0.01), and the F for 
individuals is 3.31 for twenty-seven and one hundred and eight 
df (p < 0.001). 

Also statistically significant was the analysis of variance for 
differences between drawing, tracing, and zero-induced tension 
with an F for conditions of 24.50 for two and fifty-four df (p < 
0.001), and an F for individual differences of 2.97 for twenty-seven 
and fifty-four df (p < 0.001). Bartlett’s chi-square test indicated 
these analyses to be appropriate. 


DISCUSSION 


The two generalizations considered in this study are not sup- 
ported by the results. First, increases in tension level not only 
failed to facilitate learning but produced marked inhibition. 
Secondly, the form-related pattern of the tension-inducing activity 
does not appear to be a major determinant of visual form learning 
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since tracing results in recognition decrement while drawing alone 
results in improvement in recognition over the zero-induced 
tension condition. 

Evidence from this study suggests that the effect of tension on 
learning may be determined by the difficulty and coérdination 
requirements of the tension-inducing activity. On the basis of 
Bourne’s findings for verbal learning (2), no change in form recog- 
nition accuracy was expected as a function of dynamometer 
induced tension. However, the oceurrence of decrement in this 
study cannot be interpreted as evidence that the effect. of such 
tension is different for visual form learning than for verbal learn- 
ing since Bourne’s experimental procedure differed from this 
study in an important detail. Bourne required only two-second 
intervals of pressure at a 14-maximum dynamometer tension level 
as compared to the fifteen seconds of pressure from 4g to 24 levels 
in this study. Those differences in procedure and results suggest 
that the effect of dynamometer-induced tension is a function of 
the difficulty of the procedure: with limited activity, as in Bourne’s 
study, the effect is null; with extensive involvement, as in this 
study, the effect becomes interfering and hence inhibitory. This 
speculation is further supported by the occurrence in this study 
of increased response variation, indicated by dynamometer buzzer 
signals, which appears to be related to amount of elapsed time 
within an interval and to the degree of required pressure. Similar- 
ly the decrement with tracing may be related to the difficulty 
of codrdination introduced both by the precise manual control 
necessary to trace the small image, and by the fact that slight 
head movements result in visual misalignment of image and trac- 
ing. In contrast, coórdination in drawing is relatively gross and 
nonprecise. 

The facilitation with drawing over the zero-induced tension 
condition cannot logically be referred to the form-related quality 
of the tension pattern in view of the decrement with tracing. 
Instead, a striking feature unique to the drawing condition is 
Suggested as a possible determinant of the facilitation: S neces- 
sarily makes successive, discrete visual responses as he glances 
from the training form to his copy. It may be that this activity 
promotes an optimal distribution and duration of visual responses 
as compared to the zero-induced tension condition. This specula- 
tion awaits experimental clarification. 
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SUMMARY 


This experiment explores the effect of induced tension during 
learning on visual form recognition. Experimental conditions differ 
in terms of kind and amount of tension. In each training condition 
Ss learn forms which they later recognize from among similar ones. 

Recognition proficiency is at a maximum with drawing, at a 
minimum with tracing, and decreases systematically with increases 
in dynamometer-induced tension. Evidence is presented that 
suggests the inhibition phenomena are related to the coórdination 
requirements and difficulty of the tension-inducing activities. The 
facilitation phenomenon is tentatively referred to the nature of 
the visual responses accompanying drawing. 
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CORRELATION ANALYSIS OF WISC 
AND ACHIEVEMENT TESTS 


JAMES B. STROUD and PAUL BLOMMERS 


State University of Iowa 


MARGARET LAUBER 


State of Iowa Department of Public Insruction 


The Wechsler Intelligence Scale for Children (WISC) has within 
its short history established itself as a popular test. It is now being 
used extensively in school psychological work. Yet insofar as the 
authors can ascertain there has been no large-scale inve. 
of the performance of this test in school situations, J 

The primary purpose of this investigation was to determine the 
effectiveness with which all or various combinations of the WISC 
subtests could be used to predict performance on the Reading 
Comprehension, Arithmetic, and Spelling tests of the Iowa Tests 
of Basic Skills (ITBS) battery. The sample consisted of seven 
hundred and seventy-five pupils in Grades three to six. Stanford- 
Binet (S-B) I.Q.'s were also available for six hundred and menty 
one of these pupils, making possible comparisons between WISC 
and S-B as predictors of performance on the three achievement 
tests for a large sample of elementary school pupils. 

The pupils in this sample were drawn from a twenty-county area 
in Iowa. The numbers of pupils from cities, towns, consolidated and 
rural schools were roughly proportional to the distribution of pupils 
in these categories. The WISC tests were administered by Lauber 
in connection with her work as a regional school psychologist for 
the twenty-county area. The IT BS battery was administered as a 
part of the University of Iowa’s state-wide testing program. Some 
of the S-B tests were administered by Lauber, but a majority were 
administered by local school psychologists and supervisors. The 
tests were administered over a three-year period, beginning with 
the school year 1952-53. 'The ITBS battery was administered in 
all participating schools in J anuary of each year. The WISC tests 
were administered to the pupilsin this sample within three months 
(before or after) of the date on which they took the ITBS. This 
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time relationship was not observed in the case of the S-B. Such 
S-B scores as were available were utilized, regardless of the time 
at which the test was administered. 

The pupils in this sample were all “referred” by teachers and/or 
school officials for psychological interviews and examinations. All 
of them were in, or were thought to be in, some kind of school diffi- 
culty. The mean Full Scale WISC I.Q. of the girls in this sample 
was 87.3; that of the boys, 89.7; and that of both sexes, 89.0. The 
within-grades S.D. of the I.Q.'s was 14.0 for girls and 14.8 for boys, 
and the within-grades within-sexes S.D. was 14.51. The mean 1.Q. 
of the sample is considerably below average, although the S.D. 
approaches the value for the general population. The pupils in the 
sample tend to be somewhat overage for their respective grades. 
'The mean chronological ages by grades at the time W ISC was 
administered were as follows: Grade three, CA 9:1; Grade four, 
CA 10:2; Grade five, CA 11:7; Grade six, CA 12:7. Also the pupils 
in the sample were, on the average, at least a year retarded in the 
three achievement areas studied. Perhaps it is reasonable to suppose 
that this sample is representative of the population to which we 
would most commonly wish to generalize in school practice. Pupils 
presenting some kind of school problem make up the bulk of those 
receiving individual psychological examinations. 

As has just been indieated, all subjects studied were referrals. 
The problem categories and frequency of classification, as taken 
from the regional school psychologist's files, were as follows for the 
two sexes combined: Mental Retardation, sixty-three; Educational 
Retardation (not mentally retarded, but requiring remedial instruc- 
tion, especially in reading, and teacher’s assistance in structuring 
learning activities), two hundred and sixty-eight; Personality Devi- 
ations, apparently of long standing, twenty-five; Transient Per- 
sonality Deviations (although pupil may react rather violently to 
frustrations, he responds to improved environment), one hundred 
and sixty-one; Emotional Problems, evincing serious disturbance, 
one hundred and eighty-eight; miscellaneous, some showing at least 
superficial symptoms of brain damage, seventy. 


1 For a random subsample of one hundred and fifty-six the mean time 
separation for the two administrations was two years and 10.4 months. In 
every case in this subsample S-B was administered first. 
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PROCEDURE 


(1) Within-grades zero order correlations were computed, by 
sex and within-sexes, between WISC Verbal, Nonverbal, and Full 
Scale I.Q.'s and each of the achievement tests, and between S-B 
1.Q.’s and the achievement tests, 

(2) Within-grades intercorrelations, by sex and Within-sexes, were 
computed involving S. 1.Q’s and WISC Verbal 1.Q.’s, Non- 
verbal I.Q.'s, and Full Scale I.Q.'s, 

(3) Vithin-grades zero order intercorrelations were 
among WISC subtest Taw scores and the achieveme 

(4) Beta weights for a within-grades regression an 
Reading, Arithmetic and Spelling as dependent variables and WISC 
Subtest raw scores as independent variables were computed, and 
tested for significance? The nonsignificant weights were eliminated 
in turn, beginning with the weight for which the t ri 
and regression equations determined. Multiple corre] 
the achievement tests and various 
tests were computed, 

(5) In order to test the predictions of th 


computed 
nt tests, 
alysis involving 


iio was least, 
ations between 
combinations of the WISC sub- 


RESULTS 


(1) The within-grades correlations, by sex and w 


ithin-sexes, be- 
tween WISC Verbal 1.Q.’s, Nonverbal I.Q.'s, Fi 


ull Scale TOS; 


onverbal I.Q.’s and 
n Verbal I.Q.’s and 
is statistically significant. With arith- 
higher r’s than the 
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TABLE I—Wirnin Graves CORRELATIONS BETWEEN ITBS 
Scores AND Various MEASURES OF INTELLIGENCE 


| me . 

Iremales: N = 208) Males: N = 46 iin es 

Test Variables = * 
x|4|5 Alalije 
- E cass — * I 

W:V LQ. (0.50.65 0.55 0.57 0.70 0.66* 0.58 0.67 0.62 
W:NvILQ (0.67|0.55.0.36.0.620.500.62 0.630.520.00 
W:V + Nv IQ. 0.69]0.65 0.60 0. j5 0.06 0.71* 0.66 0.66 0.67 
S-B LQ. 0.59|0.58 0.51 0.59,0.63 0.66* 0.59/0.610.61 


* Difference between corresponding r's for sexes statistically significant. 
All other differences between sexes were nonsignificant. 


in the light of the time interval between administrations of S-B and 
IT BS. As noted earlier, the maximum time interval between the 
administration of WISC and ITBS was only three months. The 
approximate mean time interval between administration of S-B 
and WISC was almost three years. This average is also indicative 
of the time interval between administrations of S-B and I'TBS. 

(2) Within-grades intercorrelations, by sex and within-sexes, 
between the various I.Q.'s are given in Table II. The magnitude of 
the correlation (r — 0.94) between WISC Full Scale 1.Q.’s and 
S-B 1.Q.’s was a little greater than one might have anticipated in 
view of the time interval between the administration of the two 
tests. 

(3) Within-grades intercorrelations among WISC subtest raw 
scores and ITBS scores are shown in Table III. It may be seen 


Taste II—WrrgIN Grapes CORRELATIONS BETWEEN 
VARIOUS MEASURES OF INTELLIGENCE 


Females: N = 205| Males: N = 416 | Within Sexes: 


N = 6n 
Test Variables w w w:lw w " 
We | rs (SB Le | Be (SB) Re | NS EB 
19.| 19. | 19 i5 tc. LQ 1o. 16. LQ 
W: V IQ. 0.73/0.92/0.87 0.64 0.91.87 0.67 0.91 0.87 
W: Nv IQ. 0.93.0.85 0.90 0.83. 0.910.83 
W: FS LQ. (0.93 p.» 0.94 


garer Ednt. 


22 The Journal of Educational Psychology 


TABLE III—WiTHIN Grape IxTERCORRELATIONS: WISC anp ITBS 


(N = 775 
Test TETEL ESETE ABD nj 13 | 18 
| | 

: à Loe oe EE 
1. W: Information 0.48 0.49 0.47 0.60 0.13 0.25 0.37 0.31 0.29 0.21/0.40 0.44 0.40 
2. W: Comprehension (0.40 0.50 0.53 0.15 0.34 0.41 0.39 0.36 0. 110.43 0.44 0.41 
3. W: Arithmetic 0.39 0.46 0.20 0.28 0.30 0.26 0.31 0.21/0.40 0.70 0.50 
4. milarities 0.49 0.15 0.33 0.31 0.35 0.29 0.27 0.42 0.43 0.41 
5. W: Vocabulary 0.08 0.27 «41 0.45 0.41 
6. W: Digit Spun 0. -20 0.26 0.23 
7. W: Picture Comp. 0.34 0.36 0.28 0. 1610.29 0.20 0.28 
8. W: Picture Arrangement 0.36 0.44 0.2410. 43 0.36 0,39 
9. W: Block Design 0.57 0.45 0.51 
10. W: Object Assernbly 0.26/0.59 0.41 0.51 
11. W: Coding | 0-31 0.20 0.31 

L1 E | ie ee E 
12. ITBS: Reading Comp. | looo. 
13, ITBS: Arithmetic 10.77 

14. ITBS: Spelling | | 


that Digit Span yielded comparatively low r’s not only with the 
three achievement tests but also with the other subtests. This 
test did, however, in subsequent analyses yield significant beta 
weights’. The r's between both Picture Completion and Coding 
and the achievement tests were also relatively low. Coding, like 
Digit Span, yielded significant beta weights, but the weights for 
Picture Completion were nonsignificant. The r’s between Block 
Design and reading and between Object Assembly and reading are 
exceptionally high. These two r's are significantly larger than the 
r’s between reading and all other WISC subtests. The r’s between 
spelling and WISC Block Design, Object Assembly and arithmetic 
were significantly higher than those between spelling and all other 
subtests. The r’s between arithmetic and WISC Digit Span, Picture 
Completion, and Coding were significantly lower than those 
between arithmetic and all other WISC subtests except Picture 
Arrangement; as would be expected, the r between arithmetic and 
WISC Arithmetic was significantly higher than any of the r’s 
between this variable and all other WISC subtests. 

In general the intercorrelations among the WISC subtests for 
this sample were lower than those reported by Wechsler. However, 
the relative magnitudes (i.e., rank order) of the coefficients were 
practically the same as obtained by Wechsler. 


* The results of the regression analyses are presented in (4). 
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(4) Multiple regression analyses based on the within-grades 
intercorrelations reported in Table III were performed with ITBS 
Reading, Arithmetic, and Spelling, as dependent variables, and all 
WISC subtest raw scores, as independent variables. Beta weights 
were tested for statistical significance and the non-significant 
variables successively eliminated. The six tests which carried 
significant weights in predicting achievement in reading and spell- 
ing were WISC Arithmetic, Digit Span, Vocabulary, Block Design, 
Objeet Assembly, and Coding. In the case of arithmetic achieve- 
ment these six tests were joined by WISC Comprehension. Among 
these subtests Comprehension (in the case of the arithmetie cri- 
terion), Coding, and Digit Span were actually of negligible practical 
significance even though statistically significant. The beta-weights 
and multiple correlation coefficients for various combinations of 
WISC tests are shown in Table IV. 

It is of particular interest that for all practical purposes the 
same set of WISC tests (Arithmetic, Vocabulary, Block Design, 
and Object Assembly) are the most effective predictors of each 
achievement test investigated. Moreover, the R’s obtained between 
the least squares composites of these four subtests (raw scores) 
and each of the criterion tests were practically as high as those 
based on all eleven tests (raw scores) and the criterion tests. Also 
the r’s between the unweighted sums of the raw scores of these 


TABLE IV—MurTIPLE R’s AND BETA-WEIGHTS FOR WITHIN- 
Grapes REGRESSION ANALYSES (N = 775) 


Reading Arithmetic Spelling 

Multiple R’s 

74 .4* 273* 16 .16* .T5* 70 .60* .68* 
Inf. 8 —0.01 —0.01 0.01 
Comp. 8 0.02 0.06) 0.07 0.01 
Arith. 8 0.11] 0.12 | 0.13 | 0.50) 0.50 | 0.53 | 0.20) 0.20 | 0.23 
Sim. 8 | 0.07 0.06 0.05 
Vocab. 8 0.19| 0.22 | 0.22 | 0.09) 0.11 | 0.14 | 0.17) 0.20 | 0.21 
D.S.8 0.06, 0.06 0.10) 0.10 0.10) 0.10 
P. Com. 8 |—0.04 —0.04 —0.02 
P. Ar. 8 0.04 0.03 0.03 
D.B.8 0.25| 0.25 | 0.26 | 0.13) 0.12 | 0.14 0.20) 0.21 | 0.22 
O.A. B 0.36| 0.38 | 0.40 | 0.10) 0.11 | 0.14 | 0.26, 0.28 | 0.30 
Cod. 8 0.06| 0.07 0.06) 0.06 0.08) 0.09 


* All weights statistically significant. 
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TABLE V—WiTHIN GRADES Corre, 
BETWEEN ITBS Tests AND SELH 
or WISC Tr 


TED COMBINATIONS 


ATIONS OF Various Typrs 


N = 775 N = 775 ws | xs 
WISC Tests ehe | me | Uma |o Nas 
(Raw Scores) | (Raw Scores) | (Scale Scores) LQ.'s 
Reading 
Eleven W: tests | 0.742 0.69 0.65 0.66 
(6V + 5Nv) 
Verbal 0.52 0.59 0.58 
Nonverbal 0.66 0.63 0.63 
3, 5, 6, 9, 10, 111 0.74 0.68 
3,5, 9,10 0.73 0.69 
Arithmetic 
Eleven W: tests (6V + 0.76 0.63 0.66 0.66 
5Nv) 
Verbal | 0.61 0.68 0.67 
Nonverbal 0.53 .51 0.52 
2, 3, 5, 6, 9, 10, 11 0.76 0.63 
8,5, 9,10 0.75 0.61 
Spelling 
] 
Eleven W: tests (6V + 0.70 0.65 0.68 0.67 
5Nv) 
Verbal 0.55 0.63 0.63 
Nonverbal 0.60 0.60 0.60 
3, 5, 6, 9, 10, 11 0.69 0.65 
3,5, 9, 10 0.68 0.64 
* Verbal prorated at 5/6. 
12 = Comprehension 
3 = Arithmetic 
5 = Vocabulary 
6 = Digit Span 
9 = Block Design 
10 = Object Assembly 
11 = Coding 


Correlation of WISC and Achievement Tests 25 


zi 
TABLE VI—Comparisons BETWEEN WITHIN-GRADES Cross-VALIDATION 


R's (N = 199) anp R's ror ORIGINAL SAMPLE (N = 775) 
Criterion | WISC Tests Batos | Regea 
ITBS: Reading , 6, 9, 10, 11 | 0.69 0.74 
9, 10 0.69 0.72 
ITBS: Arithmetic 5, 6, 9, 10, 11 0.68 0.76 
9, 10 0.69 0.74 
ITBS: Spelling 6, 9, 10; 11 0.65 0.69 
9, 10 0.66 0.68 


four subtests and the criterion tests are practically as high as 
those between the unweighted raw score sums for all eleven tests 
and the criterion tests (see Table V). As a further step, I.Q.'s 
were determined for these four tests from the WISC I.Q. table 
upon the basis of 10/4 times the sum of their scale scores. The 
r between the I.Q.'s thus derived from this abridged battery and 
the LQ.'s derived from the complete battery (eleven tests) was 
0.85. The r between Verbal I.Q.'s (six tests prorated) and I.Q.’s 
based upon 5/2 times the sum of the scale scores on WISC Arith- 
metie and Vocabulary was 0.83; that between Nonverbal I.Q.'s 
(five tests) and I.Q.'s based upon 5/2 times the sum of the scale 
scores on Block Design and Object Assembly was 0.82. 

The r's between (1) WISC raw score sums, (2) WISC scale score 
sums, and (3) WISC I.Q.’s and the three achievement tests are 
given in the last three columns of Table V. To facilitate compari- 
son, the R's shown in Table IV are repeated in Column 1 of this 
table. As was noted above, it may be observed that the r’s between 
the sums of unweighted raw scores of WISC Tests 3, 5, 9, and 10 
and the three achievement tests are relatively high. This suggests 
that the school practitioner might do about as well to administer 
WISC Tests 3, 5, 9, and 10, multiply the scale score sums by 10/4 
and consult the table for conversion of full scale scores into WISC 
I.Q.'s, as to proceed in the conventional way. This, of course, is 
on the assumption that the scores made on these four subtests 
are unaffected by taking the other tests in the battery. The as- 
sumption may not be warranted. To obtain definitive evidence 


one should have to administer these four subtests as a four-test 


battery. 
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(5) Cross-validation. The data in the preceding section tell us 
how the regression equations perform for the sample from which 
they were derived. More important is the way they perform, 
once derived, for an independent sample. To ascertain this, an 
independent sample of one hundred and ninety-nine pupils was 
drawn from the same population as the original sample. Within- 
grades correlations were computed between various combinations 
of WISC tests* and the three criterion measures. The results are 
presented in Table VI. For convenience in comparison, the last 
column of Table VI shows the inultiple R's between various com- 
binations of WISC subtest unweighted raw scores and the three 
criterion tests for the original sample. Over all, only a sinall shrink- 
age resulted. The average difference between the correlations for 
the original sample and the cross-validation sample is about 0.04. 

Comments. In this investigation WISC gave evidence that its 
popularity is well-deserved. The results justify the wide use of 
this test in school situations. Some of the subtests, however, seem 
to contribute very little to the prediction of school achievement 
as measured. It would appear, therefore, that serious attention 
should be given to the possibility of developing an abbreviated 
battery. Our results suggest the use of Arithmetic, Vocabulary, 
Block Design, and Object Assembly. It is recognized that such 
an abbreviated scale would have limited use. There is no intent 
to imply that it should be substituted for the full test in all situ- 

ations. i . 

The WISC in keeping with current practice in test design pro- 
vides both verbal and nonverbal scores and subtest scores in each 
category, so as to permit differential prediction. Insofar as our 
analyses of data go, there does not appear to be a great deal of 
support for this practice with respect to the WISC tests. The 
totalscore, as represented by Binet's concept of a "general average" 
is still far and away the most revealing score, so far as prediction 
of academic achievement is concerned. 


* The least squares weights based on the original sample were used. 
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Growing interest in the integration of television into universities, 
on- and off-campus instructional programs has begun to pose many 
questions concerning the efficacy of “canned” kinescope presenta- 
tions in relation to more conventional methods. Husband (4), in 
a comparison of live TV, kinescope, and conventional classroom 
presentations of similar course material, has produced results 
indicating superior achievement for the kinescope presentation. 
However, his report does not reveal the statistical significance of 
this superiority, the relative size of his classroom, kinescope, or 
TV-viewing groups, nor statistical evidence of the original com- 
parability of hisexperimental groups. Evans, Roney, and McAdams 
(2), ina comparative study of achievement scores for groups taught 
by conventional lecture, TV-plus-correspondence, and TV-plus- 
classroom discussion methods, found no statistically significant 
differences among any of these groups. 

The present paper compares kinescope-correspondence study 
groups without an instructor, conventional classroom discussion 
experience with an instructor, and independent correspondence 
study for their relative effectiveness as methods of providing 
university-level instruction in identical subject matter (psychology 
of child development). Course effectiveness is measured before, 
during, immediately after, and approximately four months follow- 
ing the course in terms of academic achievement, certain group 
processes, and students’ attitudes toward the experience. 


SUBJECTS 


Forty university upperclassmen served as subjects. Twenty 
students enrolled in the experimenter’s developmental psychology 
course were randomly assigned to one of three groups, each of which 
received instruction according to one of the experimental treat- 
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ments described above. The kinescope group was subdivided into 
two small viewing groups. Twenty students concurrently enrolled 
in the experimenter’s educational sociology course were given no 
consistent instruction in the experimental subject matter and were 
used as controls. 


PROCEDURES 


Common to all three experimental treatments (but not to the 
control group) was the course textbook (3), a syllabus-workbook 
including fourteen written assignments to be completed and 
graded during the course, and several auxiliary manuals covering 
specific topics such as sociometry. Identical subject matter was 
thus presented in the same sequence in each group. The classroom 
group, however, met with the experimenter at regular class sessions 
twice a week, whereas the kinescope groups cach met approxi- 
mately once a week in a drawing room to view filmed TV lectures 
on the current course topics. Members of the correspondence 
group worked entirely independently, viewed no filmed presenta- 
tions, and—like the kinescope group—maintained only postal 
contact with the instructor through the fourteen written assign- 
ments and the midsemester examination. These were regularly 
corrected and returned with appropriate comments. The experi- 
menter served as instructor both in the classroom and through 
correspondence—róles in which he had had substantial experience 
prior to this study. 

The control group was compared with the experimental groups 
initially, midway, and at the end of the study for relative achieve- 
ment of the experimental subject matter. The three experimental 
groups were compared laterally for between-groups differences— 
and longitudinally for within-groups differences whenever ap- 
propriate—in seven dependent variables. ‘These, as suggested 
above, included achievement, group cohesiveness, congruence of 
course-associated opinions, the degree of perceived intragroup 
structure or interpersonal dependability, preferences among the 
three experimental conditions of instruction, composite grades for 
the course, and subjects’ evaluations of this course’s personal 
value as compared with others taken concurrently. Data were 
collected prior to active instruction, halfway through the course, 
and immediately following the course by use of standard multiple- 
choice achievement tests, a modified Libo-Schachter cohesiveness 
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questionnaire, a sociometrie test (“With whom would you most 
(least) like to discuss topics covered in the course?"), and ad hoc 
questionnaires and rating scales. In addition, a "retention" 
achievement test, sociometry, and questionnaires were adminis- 
tered about four months after the course. Analyses incorporated 
chi-square, t, and the analysis of variance as measures of the 
significance of observed differences, and C and d—where applica- 
ble—as crude measures of relationships among the independent 
and dependent variables. 


RESULTS 


Achievement. (See Figure 1.) No significant differences in achieve- 
ment appeared among any groups at the outset, nor among the 
three experimental groups’ scores on the midsemester. At the end 
of the semester, however, the difference in achievement between 
the experimental groups and the control group was significant 
at P = 0.001; whereas there remained no significant differences 
among experimental groups. Similarly, scores based on an evalua- 
tion of all written work done throughout the course showed no 
significant differences among the experimental groups. 

Four months after the course, however, a difference just barely 
significant at P = 0.05 was found which favored the correspondence 
group over the classroom group, although no significant differences 
at this time between the classroom group and the kine- 
up or between the kinescope group and the correspondence 
analysis of the net gains in achievement 
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retained by the three experimental groups four months after the 
course (“retention” test scores minus precourse test scores) yielded 
an F ratio which was definitely not statistically significant; and a 
comparison of the three groups for differences in the amount of 
post-course change in tested achievement (post-course test scores 
minus retention test scores) produced an F ratio which was simi- 
larly far from statistical significance. 

Group cohesiveness. Members of the correspondence group were 

never administered the cohesiveness questionnaire since its con- 
tents would have been meaningless to them; their actual cohesive- 
ness as a group was assumed to be nil. Cohesiveness scores were 
obtained from the classroom and kinescope groups, however, and 
means for these two groups were not significantly different at any 
point during the semester. When the classroom group’s mean was 
adjusted by eliminating one highly deviant score on the post-course 
measurement (produced by a psychiatrist's daughter) it rose 
considerably—above the mean for the kinescope group. Yet, even 
after this adjustment the difference between the two groups 
at this point remained far from significant. 

Congruence of course-associated opinions. (Sce Figure 2.) On the 
assumptions that friendly interaction among individuals tends to 
cause their opinions about the central topics of interaction to 
converge—and that this convergence of opinions is as likely to 
occur for invalid or wrong opinions as for valid opinions—measure- 
ments of the congruence of course-associated opinions were derived 
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Fıcure 2. Convergence of course-associated opinions for the three ex- 
perimental groups on the pre-course, post-course, and durability examina- 
tions. 
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for the three experimental groups from detailed item analyses of 
their initial, final, and “retention” achievement tests. These meas- 
ures were expressed as a ‘congruence ratio”: 


Sum of multiple-choice categories in which incorrect 
answers were given 
Sum of incorrect answers 


ee) 


and the ratios for the three groups were examined for longitudinal 
differences within groups and for differences between groups. 

Although a statistically significant (P < 0.05) decrease in 
congruence was indicated for the correspondence group during 
the course, no significant longitudinal changes were found for the 
other two (presumably high interaction) groups either during or 
after the course, or for the correspondence group after the course. 
Similarly, no significant differences in opinion congruence were 
found among any of the groups at any point when compared 
laterally, even though the small kinescope groups were consistently 
highest and the interactionless correspondence group was con- 
sistently lowest in opinion congruence both during and after the 
course. 

Group structure. (See Figure 3.) On the assumption that group 
members’ willingness to choose others as performing or not per- 
forming functions central to the group’s purpose is a correlate of 
their perceptions of the group’s structure, measures of group struc- 
ture or the dependability of réle expectations were derived from 
composite (positive plus negative) sociometric subject scores for 
the classroom and kinescope groups. These showed a significant 
(P = 0.05) increase during the semester for the classroom treat- 
ment and a decrease which was not quite significant at P = 0.05 
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for the kinescope treatment. Similarly, lateral or between-groups 
comparisons of the classroom and kinescope groups on this variable 
indicated no significant differences at the beginning or middle of 
the semester, but a difference significant at beyond P = 0.01 by the 
semester's end. The delayed comparison of these two groups for 
durability of effect—or retention—showed that the kinescope 
group's sociometric choices had increased and the classroom 
group's had decreased, thus bringing them closer together than at 
the end of the course in their potentials for interpersonal depend- 
ability. Nevertheless, even at this point the difference between the 
two groups in this variable remained significant at far beyond 
P = 0.05. 

A source of error in this comparison of group structure was 
introduced by the fact that for the kinescope group n = 7, while 
for the classroom group n = 6, thus consistently giving the kine- 
scope group a greater chance to make sociometrie choices than the 
classroom group. However, any adjustment for this disparity 
would tend to lower the kinescope group's or raise the classroom 
group's indicated structure, thus uniformly increasing the between- 
groups differences described above. 

Methods preferences. (See Figures 4a, b, and c.) Students? prefer- 
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dence (totally unaccustomed experience). 
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ences among the three experimental instructional treatments—as 
measured both prior to and immediately following the course—bore 
a stable, highly significant positive relationship to the treatments 
to which they had actually been arbitrarily assigned. (At both 
timesC = + 0.80, 6 = + 0.86; P = 0.001.) Almost all individual 
statements of preference which deviated from this tendency were 
accounted for by members of the kinescope and correspondence 
groups who preferred the classroom experience, although this 
countertendency progressively decreased throughout the course 
for the correspondence group and tended to increase during the 
course for the kinescope group. 

Four months after the course this high relationship remained 
stable and substantially unaltered (C = +0.78, P? = +0.82; 
P = 0.01), although some reduction in reliability resulted from 
the use of combined measures for the analyses of pre- and post- 
course preferences! while simple measures were used in the analysis 
of retained preferences. Once again, almost all countertendency 
to the obtained relationship was contributed by an equal number 
of students in the correspondence and kinescope groups who stated 
preferences for the classroom treatment, thus stabilizing the per- 
formance of these two groups on the immediate post-course 
measurement and ending their respective trends toward and away 
from their own experimental treatments. 

Course evaluations. The students’ immediate post-experimental 
evaluations of this course in comparison to others taken concur- 
rently revealed that on a three-point scale from “more valuable” 
(1) to “less valuable" (3) the mean ratings were 1.0 for the class- 
room group, 1.7 for the kinescope group, and 1.4 for the correspond- 
ence group. 

Analysis of the relationship between these evaluative ratings 
and the subjects! experimental treatments—arranged in order of 


1 On the assumption that the mid-course measurements would serve as 
the best estimate of the mid-point of change during the course, the pre- and 
post-course analyses of methods preferences were based on n* of 
thirty rather than of twenty. These were obtained by using pre-course meas- 
ures plus one-half of the mid-course measures for the pre-course analysis, 
and by using post-course measures plus one-half of the mid-course measures 
for the post-course analysis. Additional tests were run with uncombined 
data to determine the amount of spurious correlation introduced by this 
multiple use of individuals? preferences, and no substantial amount was 


found. 
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most (classroom) to least (correspondence) similarity to their 
accustomed educational backgrounds— yielded a small positive 
relationship (C = +0.60, & = +0.56) which was not significant 
at P — 0.05. Four months later these ratings had not significantly 
altered (classroom group M = 1.2; kinescope M = 1.7; correspond- 
ence M = 1.7), nor had the relationship between them and the 
students' instructional treatments changed remarkably (C — 
+0.44, $ = +0.38; P > 0.50). However, a tendency appeared for 
students in all but the kinescope group (which was rated lowest 
immediately after the course) to lower their ratings slightly in the 
direction of “about average value." 
CONCLUSIONS 

Throughout the following discussion it should be borne in mind 
that the findings presented above were based upon experimental 
manipulations which involved never more than forty subjects. 
Thus, even though subjects were randomly distributed among 
experimental groups and experimentally produced differences 
were usually tested by the potent F ratio, any interpretation of 
these findings may not inspire the degree of confidence which would 
be possible if a larger N had been subjected to similar procedures, 

Therefore, to the extent that we can consider these findings a 
reliable representation of adult students’ reactions to the three 
experimental treatments it may be concluded that: 

(a) None of the three instructional methods is significantly 
more effective than the others in producing terminal achievement. 
All of the methods, as might be expected, are better in this respect 
than exposing the student to no systematic instruction at all. 
In addition, a literal interpretation of this analysis would indicate 
that the post-instructional retention of pure factual achievement 
tends very slightly to favor the correspondence method—and 
perhaps the kinescope method—over the classroom treatment. 
(See Figure 1.) Thus, if one is willing to consider pure tested achieve- 
ment an adequate index of the desired oulcomes of instruction, it 
tentatively appears that kinescope (or television) instruction is as 
effective as class diseussion methods for the teaching of abstract 
or highly verbal subject matter. Furthermore—and this may be of 
interest to those contemplating the expense of educational TV 
transmission—it seems that independent correspondence study is 
at least as effective in this respect as kinescope instruction, and 
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probably more so than the classroom. Some may feel that this 
finding suggests an expedient solution to the problem of over- 
crowded classrooms and understaffed teaching faculties. 

The reason for this superiority of the correspondence method 
may not be hard to find. Certainly the correspondence group in 
particular had no opportunity to be distracted from the factual 
content of the text (on which the examination was largely based) 
by the “interesting implications,” “personal applications,” and 
group interaction phenomena with which the class members were 
bombarded. On the other hand, a close examination of the data 
indicates that this significant difference in retained achievement 
may be due in part to a progressive spreading of group achievement 
means which were already separated (though not quite signifi- 
cantly) at the outset. The fact that there were almost no differences 
among the groups’ nct gains in achievement during the course and 
among their post-course losses in achievement would tend to 
support this latter interpretation. 

Finally, one might argue that tested achievement, alone, is a 
meager measure of the total desired outcomes of instruction. Such 
an argument would lead to a comparative examination of personal 
satisfaction, of the amount of opinion change, or of certain group 
interaction variables before a final conclusion could be drawn 
concerning relative instructional effectiveness. 

(b) The amount of group structure, cohesiveness, acceptance 
of others’ opinions, and certain other presumed outcomes of group 
interaction (see Figures 2 and 3) show no statistically reliable 
differences between the classroom and kinescope treatments during 
the course except for what appears to be a very reliable (and 
conservative) indication that the classroom-with-instructor method 
tends to produce group structure or dependable réle expectations 
among members at a faster rate and to a higher degree than does 
the instructorless kinescope method (Figure 3). And this differ- 
ence is quite durable; for even though a convergent or regressive 
effect is produced by the lapse of time following active instruc- 
tion, the classroom group still retains significantly more réle 
interdependability than the kinescope groups after approximately 
four months of separation. 

It could be argued that the classroom group's perceived struc- 
ture is narrowly related to the course’s formal content, and that 
if the kinescope group met as many hours as the classroom group, 
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or if this variable were measured by a sociometric criterion broader 
than “With whom would you most (least) like to discuss topics 
covered in the course?” the obtained superiority of the classroom 
group might be nullified or reversed. Such a finding, in fact, would 
be much easier to reconcile theoretically with the kinescope group’s 
slight superiority in both cohesiveness and opinion convergence. 
It would rectify the appearance given here of a negative relation- 
ship between these two variables and the structuredness of group 
members’ réle expectations. Nevertheless, even though the class- 
room instructor may deliberately establish channels for inter- 
communication on matters predominantly concerning the course’s 
formal content—whereas members of the instructorless group 
might spend more of their time in attempting to establish a more 
universally applicable set of interpersonal róle expectancies—it 
would seem that the academic purposes of any instructional group 
which perceives itself as such must remain the hub of both intra- 
group communication and of the róle expectancies which become 
central to the group's processes. In addition, the theoretical objee- 
tion raised above is considerably weakened when one recalls, 
first, that the omission of a single highly deviant cohesiveness 
score raises the classroom group substantially above the kinescope 
group on this variable, and second, that bet ween-groups differences 
in both cohesiveness and opinion convergence were not statistically 
significant either before, during, or after the course. To the objec- 
tion that the meeting hours of the kinescope group were considera- 
bly fewer than those of the classroom group it may be answered 
that this disparity of “contact hours" has been a common char- 
acteristic of the kinescoped or televised reproduction of a college 
course, and that this comparison is still, therefore, a valid one. 
Thus the classroom treatment's superiority as a method for produc- 
ing durable interpersonal róle expectancies or perceived structure 
appears to stand up under both logical and statistical analyses. 
A final note concerning the rough validity of these group inter- 
action measures is that they appear to be supported by the inter- 
actionless correspondence group's significant decrease in opinion 
convergence throughout the course, and its position as the lowest 
of the three groups in this variable by the end of active instruction 
(Figure 2). Following the course, when contact was also lost among 
members of the formerly “high interaction” classroom and kine- 
scope groups, all three groups’ measured opinion convergence 
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rose—probably as a result of practice effect.2 However, the cor- 
respondence group remained consistently below the others in this 
variable even though its post-course rate of gain was greater. 

(c) Students’ preferences among instructional methods appear 
to be mightily conditioned, first, by what they are immediately 
involved in, and secondly, by what they have become accustomed 
to in the past (Figures 4a, b, and ce). In all cases the factor of 
immediate involvement is the most powerful determinant of 
methods preferences. Thus students in all three experimental 
groups stated a preference for the treatment to which they had 
been arbitrarily assigned far more than for any other treatment or 
treatments. However, where the immediate experience differs in 
relatively small degree from the accustomed instructional pattern 
(as in the kinescope group) the initial appeal of novelty soon 
diminishes for some students and the factor of custom ultimately 
becomes more important in determining preferences. Conversely, 
to the extent that the immediate experience is more essentially 
different from customary patterns (as in the correspondence 
treatment) this experience appears to become incomparable with 
these patterns—to be evaluated according to new standards; and 
the factor of immediacy or present involvement becomes relatively 
more important. Where the factors of immediacy and custom are 
working in the same direction (as in the classroom treatment) 
there emerges an apparently unanimous preference for the im- 
mediate experience as opposed to any alternatives. 

The stability of this relationship appears to be great. For once 
these two factors—custom and immediacy—have interacted and 
established a balance, this balance appears to resist change long 
after the experiences which produced it have ceased to exist. It 
may be that the relative appeal of these two factors is determined 


2 The same sixty-item, four-alternative, multiple choice achievement 
test was used in the pre-course, post-course, and durability measurements, 
This test as a whole probably became increasingly familiar to all 
three groups; and in addition, as knowledge of course content grew, more 
and more alternatives per item were probably immediately ruled out as pos- 
sible correct answers. Thus, during the four months’ retention period—when 
all groups again became equal in the interaction variables—the greatest gain 
in opinion convergence was made by the correspondence group, the next 
greatest by the kinescope group, and the least by the classroom group. This 
is exactly the order in which the three groups fell in tested achievement 
both at the beginning and end of the course and after the retention period. 
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by two personality syndromes which remain stable and separate 
within the student population. Here, again, may be a point of 
interest to those who debate the relative merits of TV or cor- 
respondence study as substitutes for overcrowded and under- 
staffed classrooms. 

Students’ end-of-course evaluations of its personal value to them 
also appeared to be conditioned (though less powerfully and far 
less reliably) by the perceived comparability of their groups’ 
instructional treatments to customary patterns of instruction. 
Thus, in their immediate post-experimental ratings the “tradi- 
tional” classroom group unanimously rated the course more 
valuable than others taken concurrently, while the “novel”? cor- 
respondence group’s mean rating was about halfway between 
greater and average value, and the ‘intermediate’ kinescope 
group’s mean rating was little better than “of average value.” 
The effect of time passage following an instructional experience 
appears to be a leveling, or a regression toward mediocrity in the 
student's memory. This is indicated by the durability measures 
for both the classroom and correspondence groups, although the 
previously low kinescope group made no post-course change at all. 

A final inference drawn from these findings might suggest 
that—contrary to what seems to be popular practice—the greatest 
chances for the success of educational TV (or any new instructional 
medium) may lie in the unabashed exploitation of the medium's 
intrinsic advantages—precisely, the extension of detailed visi- 
bility—rather than in the cautious aping of tried and proven class- 
room techniques. 


SUMMARY 


Kinescope-correspondence instruction apart from an instructor, 
customary classroom experience, and independent correspondence 
study were compared for their relative effectiveness as methods of 
providing university-level instruction in identical subject matter. 
Subjects were randomly assigned among the three experimental 
treatments and a control group was used. All groups werecompared 
in achievement, and comparisons were made among the three 
experimental groups, only, on congruence of course-related opin- 
ions, preferences among the methods, and subjects' ratings of the 
course's value to them. Comparisons were made between the kine- 
Scope group and classroom group, only, on cohesiveness and group 
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structure. Measurements of these variables were made prior to 
instruction, midway in the course, and after a four months’ time 
lapse. 

Throughout the course it was found that although no significant 
differences appeared among the experimental groups’ achievement, 
cohesiveness, and ratings of the course’s personal value, all three 
experimental groups achieved very significantly more and faster 
than the control group. In addition, comparisons of the amount of 
congruence of content-related opinions indicated a significant 
decrease in congruence for the correspondence group, but no signifi- 
cant changes for the other two treatments. Measures of the degree 
of group structure showed a significant inerease during the semester 
for the classroom treatment and a not quite significant decrease 
for the kinescope group; thus, comparisons of these two groups at 
the end of the course revealed a highly significant difference be- 
tween them. The subjects’ preferences among experimental treat- 
ments bore a highly significant positive relationship to the treat- 
ments to which they had been assigned; however, almost all 
preferences which deviated from this tendency were accounted 
for by the instructorless groups who preferred the classroom 
experience—the correspondence students showing greater prefer- 
ence for their own group, and the kinescope students tending 
to show greater preference for the classroom. Subjects’ ratings of 
the course’s personal value were highest for the classroom group 
and lowest for the kinescope group; and they bore a nonsignificant 
positive relationship to the experimental treatments arranged 
in the order of most (classroom) to least (correspondence) simi- 
larity to customary educational methods. 

Four months after the course the correspondence group had 
retained significantly more than the classroom group in tested 
achievement, and the classroom group remained significantly 
higher than the kinescope group in group structure. In addition, 
the same highly significant positive relationship was found between 
subjects’ preferences among experimental treatments and the 
treatments to which they had actually been assigned that had 
appeared throughout the course—the deviates from this trend 
being distributed between own group and classroom group in the 
same manner as at the end of the course. 

Within the limits imposed upon generalization by the small 
sample used in this study, these findings are taken to suggest 
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that kinescope or TV techniques are at least as effective as—and 
independent correspondence study is probably more effective 
than—conventional class discussion methods for promoting durable 
factual achievement, alone, in abstract or highly verbal academic 
subjects. All of the methods, of course, are more effective in this 
regard than no systematic instruction at all. In addition, the early 
social mediation of an instructor tends to produce greater famil- 
jarity among students in the classroom than in the instructorless 
kinescope treatment; and students' preferences for their own 
instructional treatments tend to start low and increase through 
time under the unfamiliar correspondence method, whereas they 
tend to decrease with time for the partly familiar kinescope method 
and remain uniformly high under the customary classroom method. 
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THE MTAI IN AN AMERICAN MINORITY- 
GROUP SCHOOL SETTING' 


I. Differences Between Test Characteristics for 
Norm and Non-norm Populations 


JOSHUA A. FISHMAN 


College Entranee Examination Board and The City College, College 
of the City of New York 


Since the publication of the Minnesota Teacher Attitude Inven- 
tory (MTAI) (3) a number of studies have appeared touching 
upon the reliability, validity, and factorial composition of the 
scores obtained upon this instrument (1,2, 4, B IT, 14). Future 
users of this inventory have been counseled to regard it as still 
in an experimental stage due to the fact that many questions con- 
cerning the criteria which the test predicts, the interpretation of 
its scores, and the stability of its norms remain unanswered. Each 
large-scale use of this instrument must, therefore, be treated as a 
research undertaking which attempts to secure partial answers to 
the above questions in a limited setting. By sharing their insights 
into the functioning of this instrument, investigators will finally 
be able to evaluate it properly. The present study is an attempt to 
investigate certain MTAT test characteristics in a sample drawn 
from the teacher population of various idealogical-structural types 
of American Jewish schools in Greater New York City. The score- 
related characteristics here under study will be (a) type of subject 
taught, (b) age, (c) sex, (d) amount of post-high school education, 
and (e) foreign vs. American birth. 

(a) In her original instrument Leeds found teachers of "special 
subjects” (e.g. art, music, ete.) to score lower than the regular 
classroom teachers (10, 12). 'This finding was also obtained with 
the normifying population (3). 

(b) Leeds also found a significant negative relationship between 


1 The writer is indebted to Dr. Irving Lorge and Dr. Isidore Chein for 
their criticisms and suggestions in connection with the original study from 
which this report is an excerpt, although he alone assumes full responsibility 
for the data and interpretations here presented. 
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inventory scores and age (10, 12). However, her collaborators in 
the normifying study ruled the age factor out statistically by 
deleting all items which originally showed sharp age differences (3). 

(c) The MTAI norms for elementary school teachers are based 
entirely on females, due both to the small percentage of males to be 
found generally in American public elementary education, as well 
as to the particular unavailability of males in the grades during 
wartime. Nevertheless, Leeds mentions “tendencies foi male 
teachers to score lower than females" (12). a 

(d) The MTAI Manual reports that in the normifying popula- 
tion of over seventeen hundred cases the “amount of post-high- 
school education was significantly and positively related to teacher 
attitudes in graded elementary and high schools" (3). " 

(e) Finally, in arriving at the norms reported in the MTAI 
Manual, no foreign-born vs. American-born comparisons were 
made. Leeds, however, comments that she “also noted a tendency 
for foreign-born teachers to score somewhat lower than American 
born” (12). 

In this study an attempt will be made to trace the relationship 
between these five test characteristics (as obtained from the 
normifying population of American midwestern public school 
teachers) and the MTAI scores of a population largely different 
from that upon which the norms were constructed. The ideological 
characteristics of four of the five subgroups into which this popula- 
tion has been divided for the purposes of stratified sampling and 
data analysis will not be delineated here as this has been done in 
detail elsewhere (7). The fifth subgroup of teachers, namely that 
labeled “Arts,” is composed of teachers employed in all of the four 
preceding school types. However, rather than being teachers of 
traditional academic subjects these teachers teach singing, dancing, 
dramatics, or arts and crafts. 

The sample upon which the following data were obtained was 
randomly selected as part of a larger, unpublished study (6) which, 
in turn, was part of a three-year survey of a score or more aspects 
of the educational activities of this American minority group. 

Due to the non-normality as well as the lack of homoscedasticity 
of the obtained Scores, the data were analyzed by Friedman’s two- 
Way nonparametric test, of analysis of variance of means by ranks 
(9), rather than by the more searching standard technique. 
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Taste I—MxgaxN MTAI Scores ny AGE AND SCHOOL TYPE 


Age Groups 
School Types 20-29 30-39 40-19 50-59 
- We 
n x n x n x n x 

1. OAD 13 | 60.1 S | 48.8 2 6.5 8 20.0* 
2. OWD, CWD 18 | 52.4 6 | 44.8| 17 | 21.4 5 |—10.2 
3. ROD 11 | 62.45 10 |40.8 7 | 49.8 2 15.0 
4. n-rSWD 1 142.0) 2 | 31.5 4 | 19.0 4 9.5* 
5. Arts 4 | 27.8t) 5 1.2 1 |(30.0) 1 (6.0) 
Totals | 47 | 64.6] 31 | 30.6] 31 | 26-8) 2 9.2 


* Includes the score of one teacher older than fifty-nine years. 
+ Includes the score of one teacher younger than twenty years. 


1. MTAI Scores by Age Group 

The data in Table I unambiguously point toward a highly 
negative relationship between age and mean MTAT scores. The 
analysis of variance of means by ranks produces a x: value of 10.68 
for the data as a whole aeross all of the school types. As xr with 
4df equals 9.5 at the 0.05 level, our analysis reveals that the differ- 
ences between the mean MTAT scores of the various groups are 
significant at this level. 

Cook, Leeds, and Callis, in the MTAI Manual report. that in 
tryout versions of this instrument the correlation between age and 
score was —0.30. When they divided their sample into two age 
groups, namely, “below forty” and “forty and above,” they found 
the former to score significantly higher than the latter (at the 0.01 
level). These investigators comment that the observed difference 
is due not only to a constant worsening of teacher attitudes toward 
children as teacher age increases, but that it is probably also due to 
the fact that many older teachers did not have the advantage of 
the recent emphasis, in teacher training, on educational psychology 
and on the understanding of child behavior." 

As stated before, age was no longer significantly related to scores 
in the normifying population due to the deletion of items which 
showed sharp age differences. That age is still significantly related 
to scores in our current sample may be indicative of greater attitudi- 
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nal differences toward children among the various age groups 
represented within the population of teachers in schools of this 
particular American minority group than was the ease for the 
normifying population. The inapplicability of the general norms for 
teachers of this minority group, at least for more than rough within 
group comparative purposes, seems clear, at least within the frame- 
work of the age factor. 

Age itself, however, is usually of sociopsychological interest as 
an index of types or degrees of exposure. We will see below that 
age is not as important a variable in itself as it is a concomitant of 
other dynamic factors imbedded in the matrix of teacher-pupil 
interaction within the schools of this minority group. 


2. MTAI Scores by Sex and American or Foreign Birthplace 

The data in Table II point toward a negative relationship 
between mean M'TAI scores and (a) male sex, (b) foreign birth- 
place. The overall difference between the sexes is somewhat slighter 
than the differences between the birthplace groups. The analysis of 
variance of means by ranks again produces a x; value of 10.68 
which, once more, is significant at the 0.05 level. The differences 


Taske II—Mean MTAI Scores sy Sex, 


BIRTHPLACE AND SCHOOL Tyres 
Amcrican-Born | Forcign-Born 
School Types Male Female | Male g Female 
n P. n X n x n i x 


3 16.7 12 69.8 


1 9 17.7 7 42.6 
2 21 45.8 11 48.0 9 —2.1 5 10.0 
3 13 48.2 14 55.0 2 30.0 1 (17.0) 
4 1 | (83.0) 1 | (42.0) 4 | —10.2 5 27.2 
5 4 11.0 6 31.8 1 (66.0) | — = 
Totals 42 39.9 44 53.8 25 6.6 18 27.8 


Birthplace totals: 
American-born, n = 86; x = 47.0 
Foreign-born, n = 43; x = 15.5 
Sex Totals: 
Males, n = 67; x = 27.5 
Females, n = 62; x = 46.3 
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between the mean MTAI scores of the various sex-birthplace 
groups are thus significant over the data as a whole. We pause, 
however, to inquire whether the four sex-birthplace combinations, 
taken separately, also differ significantly from cach other. Utilizing 
the Link-Wallace method (13), described in section 4 below, we 
find that such zs the case for the mean score differences between 
American-born females and foreign-born males or females, as well 
as between American-born males and foreign-born males. Neither 
of the within-birthplace sex-differences is significant, however. 

In arriving at the norms reported in the MTAT Manual no 
birthplace-group comparisons were made. It is highly probable, 
however, that the vast majority of American midwestern public 
school teachers upon whom this instrument was standardized were 
American-born.” 

The significant differences between the birthplace-groups can 
probably be ascribed to differences in philosophy of education 
encountered in teacher training, as well as to differences in the 
psychological atmosphere of the family, school, and total adult- 
child interaction during the teacher’s own childhood and youth. 
In addition, the birthplace category interacts appreciably with the 
age variable, as the foreign-born teachers are, generally, older than 
the American-born teachers. The average age of the former is 
forty-five years, whereas that of the latter is thirty-two years. 

The sex difference in mean MTAT score is an interesting al- 
though not entirely unexpected finding. Many boards of education 
throughout the country do not engage males to teach in the lower 
clementary grades, notwithstanding the shortage of teachers. 
This policy is due to a conviction that females are better suited to 
the teaching of young children. In addition to some possible 
difference in the degree to which males and females may under- 
standingly empathize with young children, there are probably also 
some psychological needs of young children which females can 


better meet than males. That any sex differences which may exist 
in teacher attitudes toward young children cannot be very great 


among American public school teachers in general is attested to by 


ly middle-western nature of the normifying popula- 
deduce the modal nature of other sociopsychological 
tiate between it and the current sample. 

les are not engaged to teach below the fifth 


? The preponderant 
tion also allows us to 
characteristics which differen 

3 In New York City, e.g., ma 
grade. 
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the fact that Cook, Leeds and Callis report that “men and women 
graduate students (majoring in education, with an average of 
ten years’ teaching experience) ... have mean MTAI scores which 
are not significantly different” (3). 

The significant difference between mean MTAT scores of males 
and females in our minority group sample may be explained by 
factors additional to those mentioned above. The males in our 
sample are both older (males: average age = forty; females: 
average age = thirty-three) and to a larger extent foreign-born 
(males: thirty-seven per cent foreign-born; females: twenty-nine 
per cent foreign-born). In addition, traditional Jewish education 
for males has been more demanding and more subject-centered 
than that for females. To some extent current teachers of this 
minority group may be reacting to their pupils and to their róles 
as teachers in the light of their own childhood and adolescent 
experiences with Jewish education and with the teacher-pupil 
róles considered appropriate in those times. 

These sex and birthplace differences in mean score are not 
provided for in the general norms. Some such provision is obviously 
needed for interpreting scores of samples drawn from the universe 
of this minority group’s teachers. 


3. MTAI Scores by Extent of Secular Education 


The data in Table III clearly indicate a positive relationship 
between extent of general secular (as opposed to minority-group 
sponsored) education and mean MTAI scores. The analysis of 
variance of means by ranks produces a xt value of 9.96. Since 
x; with 4df is 9.5 at the 0.05 level, the over-all differences between 
the mean MTAI scores of the various secular educational levels 
are significant at this level. 

As stated before, amount of higher education was found to be 
positively and significantly related to MTAI scores in the normify- 
ing population of seventeen hundred cases. 

In our current sample the effect of increased secular education; 
as might be expected, interacts, to some extent, with the age and 
birthplace findings discussed previously. The older, European-born 
and -trained teachers were also found to have received less formal 
education of a general secular variety than have the youngen 
American-born teachers (6). However, there is also much to be 


^ 
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Taste I1I—Mean MTAI Scores BY EXTENT OF 
GENERAL SECULAR EDUCATION 


Extent of Secular Education 
School Types NC* c B M 
- ————ÁÀ 
n x n x n = n x 
1 3 12.3 9 30.7 14 43.1 5 13.0 
2 2 —10.0 16 5.8 20 49.0 8 58.41 
3 = = 5 42.8 13 38.1 12 61.31 
4 3 —17.5 6 45.7 "-— a 2 —10.0 
5 2 4 15.0 1 (—28.0) 4 35.7 
DAMEN ME S e 
Totals 10 2.8 | 40 | 22.9| 48 42.7 | 31 53.4 


* NC = no college; C = college attended, no degree; B = bachelor's 
degree; M = master's degree. 
1 Includes the score of one Ph.D. 


said in favor of an appreciable effect for a straight “secular educa- 
tion” factor. Increased secular education most frequently signifies 
increased familiarity with concepts of modern education, educa- 
tional psychology, child development, mental hygiene, etc. On 
the whole, an increased awareness and concern for the emotional 
rights and needs of children may be expected to parallel increased 
exposure to modern views in these subject areas. 

In this respect, then, the MTA is functioning for this non-norm 
group in much the same manner as it was intended to function for 
samples much more similar to the normifying population. 

Inereased secular education may be related to mean MTAT 
scores in yet another way, namely, via increased test sophistication. 
As a fakable instrument (14) the MTAI would be most vulnerable 
to faking by those most experienced with modern objective tests, 
i.e., by those groups with the greatest amount of general secular 
education. To the extent that this relationship between faking of 
test response and higher educational attainment is present in our 
data (as well as in the norm data), to that (unknown) extent are 

hat can be said in the current connection 


our findings suspect. All t 
is that respondent-anonymity was permitted to encourage com- 


plete frankness in the filling out of this questionnaire. 
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Taste IV—MeEan MTAI Scores or Two Successive SAMPLES 
oF TEACHERS IN Five Tyres or Scinoors 


First Sample Second Sample Total 
School Types | ^ — 
€ 1 x | SD. | m s | sx n | z | S.D 
| | = - mb 

1 26 |43.9 | 45.0 | 5 |45.8|34.5| 31 |44.2 | 43.4 

2 41 | 33.6 | 47.8 | 5 | 28.4 | 53.5 46 3.0 | 48.4 

3 5 |47.6|40.2| 5 |56.6|25.7| 30 |49.1 | 63.5 

4 2 |610/22.0| 9 |101|4L8| 11 [19.9 |442 

5 | 8 |110|506| 1 |217/200| 11 |13.9 | 44.7 
Totals 102 |40.2| 44.2 | 27 |33.1|40.3 | 129 | 36.1 | 45.5 


4. MTAI Scores and Subject Matter (“Academic” vs. 
“Arts”) Taught 


From all of the foregoing tables, and especially from Table IV 
which follows, it is obvious that the teachers in group 5, i.e., the 
teachers of art, music, dancing, dramatics, or arts and crafts, 
obtained a lower mean M'TAI score than did any other group of 
teachers. This finding is in agreement with the findings reported by 
Leeds (10, 12) and with the information given in the Manual (3). 
At this point, however, we pause to carry the analysis one point 
further and ask whether the mean score of the "arts" teachers is 
significantly different from those of the means obtained by the 
other teachers. 

A somewhat roundabout attack at this problem is necessary, due 
to the regrettably unsatisfactory nature of the reply which analysis 
of variance designs give to questions of the type: “Is column (or 
row) X significantly different from column (or row) Y or Z?" (13). 

We will, therefore, first enquire whether the five groups of 
teachers differ significantly over the data as a whole. If the answer 
to this question is in the affirmative, we will then enquire as to 
which particular groups differ in total mean scores from the others. 
In this roundabout manner we can determine from which, if any, 
of the groups of teachers of academic subjects the arts teach- 
ers might differ significantly. 

To answer these questions we will utilize the data of Table IV. 
These data were originally collected in order to determine whet her 
additional sampling was necessary before embarking on the termi- 
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nal analyses of the category comparisons reported in sections 1 to 
3 above. 

The data in Table IV indicate a highly consistent rank order 
across the two samples for the mean MTAI scores of the five groups 
of teachers. The analysis of variance of means by ranks produces a 
x: value of 32.8. Since x? with 4df is 13.3 at the .01 level, the 
over-all differences between the mean MTAI scores of the various 
types of teachers may be considered highly significant. This being 
the case, we may now proceed to enquire whether the mean score 
of the teachers in group 5 departs significantly from those of the 
other four groups. 

Mosteller and Bush (/3, pp. 305-307) present a nonparametrie 
method, first proposed by R. F. Link and D. L. Wallace, for tack- 
ling such problems in two-way analyses of variance with one ob- 
servation in each cell. This method employs the range as a measure 
of variation rather than sums of squares. It has here been employed 
after converting means to ranks. The “allowance for column totals" 
obtained when employing this method turns out to be so large, 
that we are justified in “recognizing” a difference between the 
total mean score of arts teachers and the total mean of only one 
other teaching group, namely group 3. The obtained size of the 
“allowance” is large, due to the small number of rows and columns 
employed in our analysis. Thus we are forced to conclude that the 
mean score of the “arts” and other “special” teachers, although 
low indeed, is not significantly different than the mean scores of 
most of the various types of teachers of academic subjects.‘ In this 
respect, therefore, we may conclude that although the MTAI 
functions on this non-norm group similarly to its functioning on the 
norm population, the reliability of this phenomenon is not one 
which we may accept with complete confidence. 


SUMMARY AND CONCLUSION 


When the MTAI was administered to teachers from a non-norm 
universe, a number of the characteristics which this instrument 
reveals with American midwestern publie school teachers no longer 


^ Only nonsignificant differences resulted from applying to our data the 
well-known Tukey (15) test for significance of gap after performing a com- 


mon analysis of variance with one observation in each cell. 
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obtained, while other instrument characteristics continued to 

obtain. 

Those characteristics of the MTAI which obtain for the norm 
and norm-similar populations, but which did not obtain for this 
sample from a universe of teachers in schools of an American 
minority group were: (a) the constancy of MTAI scores over 
increased age. 

Those characteristics of the MTAT which obtain for the norm 
and norm-similar populations, and which did also oblain for this 
sample from a universe of teachers in schools of an American mi- 
nority group were: (a) the lower scores for teachers of “special 
subjects”; (b) the higher scores for teachers with greater amounts 
of post-high school education. 

In addition, it was discovered that mean MTAI scores varied 
significantly with two other independent variables not employed 
in the MTAI normifying studies: (a) sex, males scoring significantly 
lower than females, (b) birthplace, foreign-born teachers scoring 
significantly lower than American-born teachers. 

Although the teachers of all school types obtained relatively 
low MTAI scores when compared with the norm of American 
public school teachers, the highest over-all mean scores were 
obtained for school types in which young, American-born, females 
with advance college degrees predominate on the teaching staffs 
(type 1 and, particularly, type 3). Conversely, those school types 
in which most teachers are old, foreign-born, males with little 
formal general (secular) education obtained the lowest over-all 
mean scores.’ 

Although departing somewhat from its functional and structural 
characteristics as a measure of teacher attitudes toward children 
in the norm-population, the MTAI scores in this non-norm popu- 
lation still varied meaningfully with independent variables affect- 
ing teacher attitudes and classroom performance. 

A final question which remains as we bring these comments to 
a close is: does the MTAI continue to measure degree of teacher 
orientation toward and acceptance of the child’s emotional and 


5 In a further study it was demonstrated that of these four variables, ex- 
tent of secular education was found to be the most significantly related to 
MTAI scores (the three other variables being held constant), followed by 
sex, as the second most significant variable. The contributions of age and, 
particularly of nativity, were negligible (6). 
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developmental needs when it is administered in such a non-norm 
universe? A partial answer to this question will be found in a 
subsequent publication (8). 
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KINDERGARTEN TRAINING AND 
GRADE I READING: 


IRENE FAST? 
University of Michigan 


The influence of kindergarten training on the later development 
of the child has been a research problem since the beginning of 
the century when kindergartens were becoming increasingly wide- 
spread on this continent. The results of investigations have sug- 
gested that children with kindergarten training advance more 
rapidly in reading (4, 2, 7) and arithmetic (/), and are generally 
more advanced throughout the elementary school (2, 6). Despite 
the consistency of these findings, however, the question remains 
open because of the difficulty of conducting systematically con- 
trolled research in this area. 

Specifically, it has been impossible to obtain comparable groups 
of children with and without kindergarten training. Where at- 
tendance at kindergarten is voluntary, as it apparently was in 
all studies, uncontrolled differences in home background may 
exist between children who attend kindergarten and those whose 
parents neglect, refuse, or are for some reason unable to send them. 
Differences in age of entrance into grade I exist in at least one 
case (2), and may be present in others where no information is 
given. Finally, where children with kindergarten training and 
those without have not attended the same schools, as seems to 
be the case in the Maclatchy (5) and Brueckner (2) studies, 
differences in educational policy and administration may have 
influenced the results to an unknown extent. 

It seems clear that until investigations controlling these factors 
can be made no final conclusions can be drawn. Fortunately the 
present writer was able to make use of an existing situation ID 
which age, home background, and school environment of the 
subjects could be controlled. In the three urban schools in which 
the study was conducted children whose fifth birthdays fell in 


! This investigation was supported by Canadian Federal Health Project 


605-5-147. 
2 The writer gratefully acknowledges the assistance of Dr. R. L. Cutler 
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November and December of the current school year (1953) were 
excluded from kindergarten training because accommodation was 
limited. The following year they were admitted to grade I and 
placed in the same classrooms as first grade children who had 
received kindergarten training. Thus it was possible to examine 
differences between one hundred and thirty-four children with 
kindergarten training and forty-six children who had no opportu- 
nity for such training. No difference between the two groups 
other than age was introduced by the criterion of selection for 
kindergarten training. 

It was decided to focus the study on reading progress in the 
first grade, both because reading is generally considered to be of 
primary importance in grade I, and because standardized tests of 
reading are readily available. It was predicted that initial reading 
scores of children with kindergarten training would be higher than 
scores of children without such training, and that this advantage 
would be maintained throughout the school year. 


METHOD 


Subjects. All grade I children enrolled in the three urban schools 
were given a battery of tests. The test results of one hundred and 
thirty-four children with kindergarten training and forty-six 
children without such training were utilized in the present study. 
The test results of children who were repeating grade I and of 
those absent during one or more of the testing sessions were not 
used, 

Materials and Procedure. Following is a list of tests included 
in the battery administered to the above-described population: 

(1) The Dominion Group Test of Reading Readiness, Short 
Form, Omnibus Type B was administered to all grade I children 
in the three schools four weeks after the beginning of the fall 
term. It was given at this time in order that the children would 
have had some opportunity to become accustomed to the classroom 
situation without having had a great deal of formal reading in- 
struction. 

(2) The Survey Test in Silent Reading (First Experimental 
Edition, now being standardized) was administered at the begin- 
ning of February to discover whether differences between the 
kindergarteners and nonkindergarteners were still evident. The 
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two parts, Word Recognition and Paragraph Reading, were treated 
separately in the comparison of groups. 

(3) The Achievement Tests in Silent Reading; Primary, Grade 
I, type III; Paragraph Reading Test; and The Group Test of 
Learning Capacity, Primary, were administered in May. 


RESULTS 


The initial requirement in presenting the results of this study 
is to demonstrate that no differences in age or home background 
influenced the observed differences in reading scores. 

Seven of the one hundred and thirty-four kindergarteners had 
birthdates in November and December and had received kinder- 
garten training in other schools. There seemed to be no a priori 
reason for excluding these from the sample. Twenty-one of the 
forty-six nonkindergarteners’ birthdates were between January 
and October. These children had not been excluded from kinder- 
garten but had failed to attend, probably because their parents 
had been unable to arrange for transportation and they lived some 

distance from school (maximum five miles). To investigate the 
possibility of differences in reading ability or I.Q. between these 
children and the nonkindergarteners whose birthdates fell in 
November and December t tests were made. No significant dif- 
ference in either reading or I.Q. was found. Therefore the forty-six 
nonkindergarteners are treated as one group throughout the study. 

No difference between kindergarteners and nonkindergarteners 
in home environment was introduced by the criterion of selection 
for kindergarten training. To explore the possibility that differences 
in economie status existed between the groups, the occupations 
of the fathers of the subjects, where they were available, were 
plotted on a six-point scale published in the 1947 census data (3). 
A chi-square test showed no significant difference in economic 
status between fathers of kindergarteners and nonkindergarten- 
ers. 

Since only children who were five years of age before November 
1, 1953, were admitted to kindergarten, and most nonkinder- 
garteners reached age five in November or December, 1953, the 
kindergarteners were slightly older than the nonkindergarteners. 
Scores of kindergarteners and nonkindergarteners on all tests 
were plotted against age. Because no relation between scores and 
age within the one year period which included all dates of birth 
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Tasie I—Tue Means on Four Tests AND SIGNIFICANCE OF DIFFERENCES 
BETWEEN KINDERGARTENERS AND NONKINDERGARTENERS 
Equatep on IQ. AND M.A. 


Mean Scores Subjects Equated on 
Month of F 
Administration Test 
K Non-K LQ. M.A. 
October Reading readiness 8.67 4.03 | 0.0025 | 0.0025 


February Word recognition 13.71 11.83 | 0.028 | 0.013 
February Paragraph reading 5.45 3.96 | 0.018 | 0.005 
May Paragraph reading | 13.30 9.54 | 0.0015 | 0.035 


was found, it was concluded that age difference within this period 
did not significantly influence reading scores. 

Differences in I.Q. (scores on the Dominion Group Test of 
Learning Capacity, Primary) were controlled by matching kinder- 
garteners and nonkindergarteners. The means of reading scores 
in each interval of five I.Q. points were computed for each group 
on each reading test. The mean scores of kindergarteners and 
nonkindergarteners were compared using a one-tailed t test for 
matched individuals. The significance of differences is shown in 
Table I. 

To control both age and I.Q., the groups were then equated 
as described above, using five-month mental age intervals. As 
is shown in Table I the groups differed significantly on each test. 
The similarity between results when children were equated on 
I.Q. and M.A. scores corroborates the finding that age differences 
within and between groups did not significantly affect the reading 


Scores. 
DISCUSSION 


In general the results of previous studies (4, 2, 7) have been 
confirmed: kindergarteners achieve significantly higher scores in 
reading tests than nonkindergarteners. At least three further 
considerations determine the value of the results. 

The first concern is the generality of the results beyond the 
schools in which the study was conducted. The three schools were 
within the public education system. Because a chief area of vari- 
ability within authorized programs concerns formal reading in 
kindergarten it should be mentioned that, while explicit reading 
readiness programs were followed, reading was not formally taught 
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in the kindergartens of any of the three schools. The use of widely 
distributed, standardized reading series in the first grade suggests 
that reading experiences were broadly similar to those found 
throughout this continent. The quality of instruction was not 
unduly influenced by any single teacher because the children were 
distributed among three teachers in kindergarten and among 
nine in the first grade. These factors suggest that similar results 
would be obtained if the study were repeated elsewhere. 

The other two questions concern the extent to which similar 
differences may be found in other aspects of development and 
the length of time in the academic progress of the children that 
differences in achievement persist. At present these questions 
cannot be answered decisively. If the kindergarten reading readi- 
ness program is most closely related to reading progress in grade I 
we would not expect to find differences between kindergarteners 
and nonkindergarteners in other aspects of development. However, 
the major part of the kindergarten program is devoted to more 
general development in social skills, the ability to work independ- 
ently and in groups, and the acquisition of verbal and manual 
skills. If these are the more important factors differences between 
kindergarteners and nonkindergarteners in subjeets other than 
reading might be found in grade II or later. 

]t is hoped that these questions may be clarified by an extension 


of the study during the current school year. 


SUMMARY 
g, middle, and 


Reading tests were administered at the beginnin 
children 


end of the grade I year to one hundred and thirty-four 
who had spent a year in kindergarten and forty-six otherwise 
comparable children who had not. Group tests of mental capacity 
were administered at the end of grade I. 

Children with kindergarten training were found to achieve 
significantly higher scores on all reading tests than children without 
such training. The possibility of clarifying the degree to which 
such differences existed in other academic areas and persisted 


in later years was explored. 
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ARTHUR E. Fink, EvEnETT E. WILSON AND MERRILL B. CONOVER. 
The Field of Social Work, third edition, New York: Henry 
Holt and Company, pp. 630, 1955. 

An already good text has been greatly improved in the third 
edition. Concerning the addition of two authors, the Preface says: 
“One evidence of the acceleration as well as of the range within 
the social work field is reflected in the triple authorship of this 
revision . . . a volume that undertakes to deal with as extensive an 
area as social work now demands the thinking and the experience 
of other persons.” 

The work is organized about sixteen main heads: the problems 
that people present; the development of social services—the 
European background; social services 
almshouse to social security; social 
church to the charity organization society movement; social case- 
work; social services in a family-focused agency; services in a local 
welfare department; welfare services for children; 
social work; medical social work; the corree 
social work; social services for the aged; social group work; com- 
munity organization for social welfare; and the profession of social 
work. The chapters on the problems that people bring to social 
agencies, on social casework, and on social services to the aged 
are new. New illustrative material has been added and the bibliog- 
raphies have been brought up to date. Sources for films are indi- 
cated. 

The historical chapters give a valuable background for the 
esent-day social work. The complexity of 


in America—from the 
services in America—from the 


psychiatric 
tional services; school 


whole. 
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failed in some other work. It is something which now stands on 
its own feet and is to be respected by those who understand it. 
Its interrelations with many organizations and funds are clearly 
indicated in this volume. 

For this tightly packed and unpadded book the present reviewer 
has little of adverse criticism. But although prevention is men- 
tioned and in several places implied, it seems that more on this 
most important subject might have found place; it does not appear 
in the Index. Mental Hygiene finds some place but not the more 
modern development. under the now more common head of Mental 
Health. This most important problem of civilization might have 
had a little more attention. There is, however, considerable under 
Mental Illness. 

Case studies at the ends of chapters make a valuable contri- 
bution to the understanding of the social work problems. There are 
some footnotes, and especially excellent bibliographies at the 
end of chapters. — 

The book will be useful not only as a text but also as a reference 
book for many details in the development of social work. 

A. S. Epwarps 


The University of Georgia 
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A SECOND QUARTER CENTURY OF DELAYED 
RECALL OR RELEARNING AT 80 


L. B. HILL 


Fifty years ago, A. E. Rejall and L. B. Hill (the present writer, 
referred to later in this study as the experimenter) , then students at 
Teachers’ College, Columbia University, learned to typewrite with 
a certain degree of proficiency, under experimental conditions. The 
experiment consisted of learning to typewrite by writing & one 
hundred-word paragraph, and by copying, at the same sitting, a 
page of new material containing three hundred words. ‘The original 
data were analyzed by the late Dr. Edward L. Thorndike and pub- 
lished in the Pedagogical Seminary of December, 1918, under the 
title “Practice in the Case of Typewriting.” slesi 

Twenty-five years later, the writer, who had had very little na 
nificant practice in using the typewriter in the meantime, recalle 
the learning formerly acquired. The method of recall was the same 
às that used in the earlier experiment. The data thus secured were 
published in the Pedagogical Seminary and Journal of Genetic 
Psychology under the title “A Quarter Century of Delayed Re- 


call,” 

Now, twenty-five years after the first recall, and fifty pea 
after the first learning, the writer presents herewith certain d 
from a second quarter century of delayed recall, together with his 
own interpretation and comments upon the findings. 

gnificant to à second recall, should 


Certain facts, more or less si 
be kept in mind, such as the age of the learner and the fact that he 
i Since retirement he has done 


retired from teaching ten years 280. 
no significant work of a professional nature. He has changed en- 
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vironment and has been identified with some civic organizations 
in the community in which he has lived. He has engaged in some 
pleasurable activities such as taking care of a new home, planting 
flowers and shrubbery, and learning to play shuffleboard, to dance, 
and to play a better game of bridge. His physical condition may be 
described as good. The only real physical impairment which he 
recognizes is in eyesight which makes his reading a bit slow and 
difficult. 

The second recall of learning to typewrite was undertaken (1) 
because of a persistent interest in the learning process—an interest 
which lends pleasure in learning anything new, even at the ex- 
pense of considerable effort. (2) Since writing with a pen has be- 
come a little difficult, it will be worthwhile and pleasurable to be 
able to use the typewriter. (3) It will also be interesting to find 
out whether the experimenter, at his present age, can still learn, 
and to see what factors enter into the process. Other factors favor- 
able to undertaking the project may be mentioned as a feeling of 
well-being, living in an environment where a great many old 
people learn new hobbies, play a great deal, and enjoy life in re- 
tirement. 

Reference to Figure 1 shows that in the first experiment the 
learner wrote the one hundred-word paragraph at the rate of 7.4 
words per minute at the first sitting. (In all three experiments two 
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words were arbitrarily deducted for each error.) In the first recall 
he wrote the same paragraph at the rate of 19.8 words per minute, 
and at the beginning of the second recall at the rate of 10.8 words 
per minute. These measures taken at face value indicate that the 
experimenter, on the first day of the first recall, wrote the para- 
graph at a rate that took twenty-seven days practice to attain in 
the original experiment. In the second recall, whieh was under- 
taken twenty-five years after the first with no practice whatever 
in the meantime, and fifty years after the first learning, it took 
approximately eight days to secure the same skill. However, the 
two curves are similar throughout. Near the ends of the two recall 
curves it may be seen that the experimenter, in twenty-five to 
thirty days, was performing in a manner equivalent to that which 
had taken one hundred twenty-six days to accomplish in the origi- 
nal learning. At the end, both curves indicate that there was no 
limit to improvement in sight, and regardless of the age of the 
learner, improvement might go on indefinitely. 

In copying the three hundred-word page of new material (Fig. 
2), the story is somewhat different. In the original learning the ex- 
perimenter copied the page at the rate of 4.4 words per minute. In 
the first recall he copied at the rate of 12.4 words per minute at the 
first practice, and in the second recall, on the first day, he wrote 
the page at the rate of 6.1 words per minute. Glancing at the two 
sets of practice curves it would seem that there is considerable simi- 
larity at the beginning. The experimenter was copying at a rate 
which, in three to seven days, took four times that many days to 
accomplish at the original learning. The first recall continues to 
show considerable improvement over the first learning, but the 
early promise of the second recall soon fades out and the curve 
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almost parallels the original learning curve. Even this may be 
favorably regarded as significant considering the age of the ex- 
perimenter. He believes, however, that the lack of more favorable 
progress is due to poor eyesight which makes reading from the 
printed page laborious and slow. This difficulty probably accounts 
for the leveling-off of the second recall curve, and probably indi- 
cates a limit to improvement in the rate of copying anything from 
the printed page where reading is involved. 

Within the limits of this experiment certain observations may be 
made. The experimenter believes that age, as such, is not neces- 
sarily a deterrent to either new learning, or to recalling that which 
has formerly been learned. Physical handicaps do necessarily make 
a difference in performance. Muscular reflexes, where physical 
handicaps are not involved, perhaps may present a question for 
further investigation and study. This experimenter is inclined to 
believe that muscular reflexes in the aged, as in youth, show in- 
dividual differences. 

Some of the same factors that are present in learning at an 
earlier age are also present and effective at maturity. Interest, de- 
sire, and determination are potent factors. The will, as such, is a 
powerful stimulus. Competition against one’s self is also very ef- 
fective and may be more so in the aged than youth. While com- 
petition against rivals or competitors is a great stimulus to youth, 
the aged, as a rule, do not care who excels them. 

In the case of the present experimenter, when he discovered that 
he could recall and even excel hi 
Possessed the necessa: 


RECRUITING PROSPECTIVE RURAL TEACHERS 
AT THE MEN’S TEACHERS’ COLLEGE 
TRIPOLI, LIBYA 


IBRAHIM E. METAWEH* 


UNESCO Mission, Tripoli 


During the writer's successive contacts as & UNESCO tech- 
nician with the Libyan education officials, the problem of effec- 
tively recruiting prospective rural teachers was raised. Teachers 
and students at the Teachers’ College were included also in these 
contacts, and as a consequence, the idea of establishing an ade- 
quate instrument for recruitment was defined and clarified. The 
writer had noticed in the College several striking cases wherein 
students with serious defects had been admitted in the past, and 
since it seemed desirable to have some kind of device that would 
be effective in screening applicants, the idea of the interview test 
gradually took shape in the minds of the educational officials in- 


volved. 

The purpose would be to raise th prospective rural 
teachers admitted to the Teachers’ practical, 
objective instrument for screening applicants, namely, an inter- 
view test—one which might be standardized su 


in other Teachers’ Colleges in the country. Th es 
would be the selection of the fittest for the teaching profession irom 


among the applicants, most of whom come from rural districts. The 
procedure adopted may be of interest to those working in similar 


regions and perhaps to others 85 well. 


THE INTERVIEW CARD 
as age, previous school grades, address, name 
ed from every student applying for ad- 
d for each rater to mark the applicants 
on each of seventeen items. The items appearing on the card are 
presented below, supplemented by explanations and samples of the 
questions asked. The total number of points or marks arbitrarily 


assigned to each item and (for the main points) the number of 
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questions under each are shown in parentheses. Students scoring 
between 40 and 49, were to be referred to a revision committee com- 
posed of the Dean of the College and two teachers in the Educa- 
tion Department. The minimum total to be obtained by every 
student was set at 50 marks out of a total of 100. 


(1) Physical Aspects (30 Points, 24 Questions) 


Failure in this item alone might result in ruling out the student. 
This would happen if he has an apparent physical defect sufficient 
to hinder him from expressing himself or from performing activities 
necessary to the teaching profession, or to make him marked among 
his students. This applies to the cases where the college cannot do 
anything at present to correct the defects. In such cases, however, 
the interview is continued and the student is referred to the re- 
vision committee. Elimination of cripples is necessary as we need 
the teacher to be a Jeader too, but this does not rule out the possi- 
bility of his being accepted in another profession. 

(a) General appearance (10). Personal cleanliness and being 
well dressed without regard to the quality or price of clothing worn. 

(b) Speech and clarity (10). Notice in the students’ answers 
the fluency of speech, diction, ease in pronunciation, and form of 
sentences. 

(c) No tics and defects (10). Notice here apparent lameness, 
dwarfishness, partial but apparent, or complete deafness, cross- 
eyedness, stuttering, stammering, lisping, awkward-walking, one- 
eyedness, facial deformities, and various tics, such as exaggerated 
gesturing, excessive movement of eye lashes, winking, excessive 


movement of head or shoulder and the scratching of various parts 
of the body. 


(2) Attitudes and Personality (25 Points, 28 Questions) 


(a) Emotional balance and control (5). (The student can be 
asked questions like the following, although it is not necessary to 
confine the questions to those mentioned here.) (a) What is your 
reaction if Someone in a group does not like you? (b) What is your 
reaction if somebody differs from your opinion, and what are your 
rs toward him? (c) How would you behave if you knew 
re Y of your friends told others something about you that you 

not like? (d) If you were late to a lecture and the teacher re- 


UJ 


Recruiting Prospective Rural Teachers 71 


fused to admit you, how would you behave? (e) What would you 
do if you were to fail this test? 

. (b) Social interests (5). (a) Do you like to be alone most of the 
€ or do you get bored? Do you like to join groups? (b) Do you 
— Le e with others in organizing parties and public 
cwm ES! (c ) Do you prefer to have a pienie alone or with others? 

Ow vat extent? (d) What is the number of your acquaintances 
and friends? (e) Do you like to break the rules? 

(c) Leadership and guidance (5). (a) If you are asked to par- 
irip or social event, what róle do 
a picnic, 
or do 
f you 


ticipate in organizing a party, 
you like to take? (b) If you like to accompany others on 
do you like to have an opinion as to where you should go, 
you accept easily any place suggested by the others? (c) I 
to be in front or in the middle, or do 


walk with others, do you like 
Have you studied the life of 


you not care about your order? (d) 
a leader? Who is he, and what did you admire about him? (e) If 
you are among a group of people who are discussing some affairs, 
what is your stand? Do you just listen to others? Do you partici- 


pate in the discussion or do you guide the discussion? 
(d) Ability to comprehend and adjust (5). (a) Complete the 
following sentence: “The relationship between the ox and the 
plough is like the relationship between the locomotive and wae" 
(b) Complete the following chain of numbers: 

9-1-9-2-9-. ..—- -. 


prt M pres 
both sides (mother and father). What 
d the mother of 
alf the speed of Ahmed and 
Ahmed’s speed is one and a half as much as the speed of Elsayed. 
What is Ali's speed if Elsayed's speed is 6 kilometres per hour? 
(e) What would you do 1 ring house were on fire? 
ked to talk on à subject with 


(f) What would you do if you were aS E 
t acquainted? (g) Supposing that you were late 


which you were no 
for an important appointment where your presence Was necessary, 
how would you act? h) If you were given fifty Libyan pounds, 
what would you do? ; 

(e) Independence and self-reliance (5). (a) Do you seek the 
help of others regarding your personal problems? (b) If you find 
it difficult to understand a lesson or an arithmetic problem, what 
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figures about these states? The relationship between Libya and 
these states? Important products of these states? Their famous 
men? Kind of government? The United Nations, Specialized Agen- 
cies and Security Council? The functions of each? 


PROCEDURE AND APPLICATION OF THE TEST 


Each interviewee was rated by a committee of at least three per- 
sons from the staff of the Teachers? College, one with a scientific 
background, the second with an educational or social-studies back- 
ground, and the third with a linguistic or artistic background. 

Directions to the committee members caution them not to begin 
to question a student until they establish rapport and are sure that 
he is at ease. Students can be assured of permissiveness by talking 
to them about simple and ordinary things in a friendly way. It is 
strictly forbidden to scold a student or to be sarcastic to him during 
the interview, no matter what his answers are. He is not to be 
hurried in giving his answers or interrupted in his train of thought. 
The committee members are reminded that the students examined 
are young and in the process of growth, and that the conditions of 
their environment, their way of life, and economic status should 
be taken into consideration while examining them. It is not ad- 
uliar features of arcas that 
dequate time to reveal their 
interests and attitudes, and at the same time to guard against ex- 


hausting them, the period of the interview was held to ten or fifteen 
minutes, 


Also, it is not compul 


directions stated. They are given as examples but the test, permits 


flexibility, 
When the test interview with its four main points, one hundred 
marks 
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formed at the Teachers’ College made up of three teachers each, 
all of whom studied the test in advance. 


RETURNS AND EVALUATIONS 


Five hundred and thirty-six students applied for admission vary- 
ing in age from eleven to twenty-four, approximately three out of 
every four students coming from rural areas. The average age of 
the students who applied was sixteen years and ten months. The 
average age of the admitted students was sixteen years and nine 
and a half months. Twenty-six students were not admitted because 
they were thought to be too young or too old. (Table J.) The age 
of seventeen years was thought to be most suitable for prospective 
teachers as they will have to stay in college for four years, and by 
the time they graduate at the age of twenty-one, it is highly prob- 
able that they will be mature enough for teaching. 

Of the five hundred and ten applicants not disqualified because 
of age, 60.4 were admitted (57.5% of the total number). It may be 
that some of the more than one-third who were disqualified might 
have become successful in college and as teachers. This will ined 
be known, and only time will tell how many of those who were ad- 
mitted will be successful. Of those disqualified because of a score 
of less than 50, some extreme cases W tudents who were 


ere—ten sS : 
i rin 
extremely short, four one-eyed, seven stuttering and stamme 7 


p Nor ApMITTED 


TABLE L—AGE OF APPLICANTS ApMITTED AN 

B A Applicants Admitted Not Admitted 
gc 

i 0 10 
12 years or less S 4 A 
13 years i : i 
recens 34 21 18 
n years 149 91 58 
17 yim 181 n. s 

: 5 
18 years B “= 
19 years / i : 
20 years : : : 
21 years E i 
myers andorr Ji 

Total 536 308 228 
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cases, five lame, and twenty apparently suffering from malnutri- 
tion. 


This project is radical in recruiting intended teachers in this re- 


Since it had not been customary to obtain the exact birth dates 
of applicants nor a transcript of credits from the schools they had 
previously attended, it is as yet impossible to correlate ages and 
marks with the results from the test interview, but it is Proposed to 
correct for this deficiency in the future, Similarly to test the in- 


st, independence, and self-reliance were 
View test results coupled with the observa- 


cial attitudes, independ i 


he first use of an instrument in this country has been most 
gratifying since it Was a codperative enterprise that seemed to 
meet the needs of the situation 88 we knew it. As a result of its use 
Way, we can look forward to a gradual 
n Lybia, and hence, of its educational 


THE INTELLIGENCE TEST IN SCHOOL USE: 
SOME PERSISTENT ISSUES' 


J. B. STROUD 


State University of Iowa 


The first successful intelligence test was designed with little re- 
gard to definition or theory. Binet proceeded in an empirical fash- 
ion to ascertain kinds of tests that would discriminate among cri- 
terion groups. Kuhlmann, Terman and most of the early workers 
adopted the Binet procedure. On the other hand we are all familiar 
with various analytical approaches to the problem of intelligence 
test design. Thorndike’s area, altitude and speed; Spearman’s g and 
s factors; and Thurstone’s primary mental abilities are cases in 
point. Thus, Spearman wrote, “When a knowledge of g and s factors 
makes it possible to construct tests which measure g and s at the 
maximum, àn accurate measure for G for any individual can be 
obtained.” This, he maintained, is far better than an average de- 
rived from a miscellany of unrelated traits (7). 

Thus, the history of the intelligence test reflects two major ap- 
proaches in test design: (1) the empirical and (2) the analytical. 
In the former, the psychologist attempts to maximize differences 
among criterion groups. This is largely à utilitarian approach. In 
the analytical approach the psychologist investigates the nature 
of intelligence, with little immediate concern for the practical. Ob- 
viously it was Spearman’s hope that the two approaches would ul- 
timately amount to the same thing. As yet this hope has not been 
fulfilled. Test designers have generally resorted to the Binet 
method of maximizing differences among criterion groups. The ad- 
vancement in design is reflected largely in statistical and logical 
refinement of Binet’s general plan. Perhaps the most notable de- 
parture is the PMA battery. Here one gains the impression, from 
the somewhat limited research, that insofar as the tests of the PMA 
battery are factorially pure, most of them assess mental abilities 
not closely correlated with school achievement (11). 

Binet regarded the MA as à general average. Perhaps the gen- 


eral average is still the most significant feature of the major tests. 


————— n 
1 Presidential Address, Division of Educational Psychology, 1956. 
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However, most general tests have featured batteries of subtests 
permitting part scores and resulting profiles. This practice tends to 
insure systematic sampling of intellectual abilities and thus to im- 
prove the “general average.” Ostensibly the purpose of this feature 
is to provide for differential diagnosis and prediction. Although 
discussion of the analytical features of intelligence scores has 
sharpened within the last decade or so, the usefulness of the part 
scores in school situations still suffers from a lack of detailed re- 
search on the differential power of the constituent subtests, 

Nonverbal tests came into the picture originally as a means of 
examining persons presenting language handicaps. Most, of the 
new general intelligence tests within recent years have featured 
both verbal and nonverbal scores. Wechsler's recent and impressive 
Intelligence Scale for Children, affectionately referred to as the 
WISC, and the more recent and equally impressive Lorge-Thorn- 
dike tests are cases in point. Insofar as we know, at least insofar as 
I know, this trend in test design is dictated more by fashion than by 
empirical evidence attesting its usefulness in general school situa- 
tions. I suspect one would have some difficulty marketing group 
intelligence tests today that did not contain nonverbal features, I 
also am inclined to suspect that their worth as nonverbal tests in 
school practice has not been demonstrated by research. What the 
clinical psychologist is able to make of the results of this kind may 
be another matter. 

Most of the earlier nonverbal tests were seriously lacking in 
validity as intelligence tests, We have just completed a correlation 
analysis of the WISC on about eight hundred school children, 
grades three to six inclusive, In our results the nonverbal tests pre- 
dicted academic achievement quite as well as the verbal tests. The 
attested usefulness in school situations, however, of the Wechsler 
nonverbal tests may well stem from the fact that they are valid 
tests of intelligence, rather than from their nonverbal character. 
Quite conceivably additional verbal tests of comparable quality 
might be substituted for the nonverbal ones without detriment to 
the battery. 

„Our analysis failed to yield much in the way of differential pre- 
diction of the achievement measures used, reading, arithmetic, and 
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were better tests rather than from qualitative differences. A multi- 
ple regression analysis was performed based upon within-grades 
intercorrelations between the eleven WISC subtests and the three 
achievement measures. Beta weights were tested for statistical sig- 
nificance and least significant variables successively eliminated. 
It was found that four WISC subtests—Arithmetic and Vocabu- 
lary among the verbal tests, and Block Design and Object As- 
sembly among nonverbal tests—accounted for most of the correla- 
tion between the WISC and the achievement measures. The 
within-grades least squares multiple correlation between all eleven 
WISC tests and reading, for example, was 0.74. The within-grades 
least squares multiple correlation between these four subtests, Vo- 
cabulary, Arithmetie, Block Design and Object Assembly, and 
reading was 0.73. The correlation between the unweighted raw 
score sums of the four subtests just referred to and Reading was 
0.69. Comparable results were obtained for arithmetie achieve- 
ment and for spelling achievement. Itisa particularly striking fact 
that for all practical purposes the same four WISC subtests are 
the most effective predictors of each of the three achievement tests. 
For cross validation purposes the same kinds of analyses were car- 
ried out on a sample different from that on which the regression 
equations were derived, but drawn in the same way. The results 
obtained with the independent sample were strikingly similar to 
those obtained on the original sample (12). 

Incidentally, analyses of this kind seem to be indicated for dif- 
ferential aptitude tests. It has been suggested that in view of the 
high correlations obtained between general aptitude tests and in- 
telligence scores and also between general achievement tests and 
intelligence scores, the intelligence test is no longer required in 
school practice insofar as prediction of achievement is concerned. 
It is true that prior achievement predicts future achievement at 
least as well as does the general intelligence test, But it is equally 
true that the intelligence test which can be administered in a frac- 
tion of the time and at a fraction of the cost predicts future 
achievement almost as well. The merits of the argument must rest 
pretty largely upon the nature of the evidence of differential pre- 
diction. This brings u$ back to the intelligence score. How much 
have we improved upon Binet's “general average," beyond obtain- 
ing better averages? The value of pat scores on à test—verbal, non- 
verbal or others, depending upon the analytical properties of the 
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test—must be assessed in terms of differential prediction, actually 
substantiated by research. Perhaps it is true that the general 
achievement test and the gencral intelligence test each tells us 
something unique about the pupil and that arguments about one’s 
replacing the other could well be forgotten. However, and for the 
time being, it may be well to bear in mind that the intercorrelations 
among achievement scores in various school subjects are relatively 
high. In view of this fact it will be difficult to do a great deal with 
differential prediction. Consider a medical aptitude and a legal ap- 
titude test, for example. One has a medical aura, the other a legal 
one. But it still may be true that they could be pretty well inter- 
changed so far as their basic purposes are concerned. 

Another issue at least related to the intelligence score is the 
generality of the mental age. The mental age score, whether deter- 
mined by Stanford-Binet method or that derivable from Wechsler 
tests by multiplying the obtained IQ by CA, is, in effect, an aver- 
age score. Are some MA’s, say, of ten, better than others? Do MA’s 
of a given magnitude vary qualitatively? In our work we have 
sought to determine whether or not MA’s of a given magnitude 
vary qualitatively with the brightness of the pupils achieving them. 
A related but somewhat different question is whether or not we can 
make the same predictions relative to current school achievement 
of bright and dull pupils earning like MA’s. Of course, we would 
predict much greater future achievement for the younger pupils. 
In one investigation Kolstoe (8) formed matched pairs of bright 
and dull pupils achieving equal, estimated true Stanford-Binet 

MA’s. The mean Stanford-Binet MA was 11.3. The mean CA of the 
dull members was 15.5, giving a mean IQ of 79. The mean CA of the 
bright members was 9.3, giving a mean IQ of 126. The bright and 
dull pupils in the sample thus formed were then compared on the 
basis of the eleven WISC subtests and the PMA subtests. Except 
for PMA subtest Number, differences between the bright and dull 
pupils on the seventeen subtests were small and generally not sig- 
nificant. The dull, older pupils were far better in performance on 
PMA subtest Number. 

Ina companion investigation, Bliesmer (2) compared bright and 
dull pupils, matched in MA, on reading achievement. On reading 
subtests Recognition of Main Ideas, Drawing Inferences, and Total 


Comprehension, the bright, younger pupils excelled by significant 
amounts. 
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The school placement of the older pupils in this sample was three 
or more grades higher than that of the younger pupils. Owing to the 
hierarchical character of the arithmetic curriculum, the younger 
pupils had had no instruction in some of the operations which the 
older pupils had studied. This may account for some or all of the 
superiority of the older pupils in Number. One might think, a 
priori, that young, bright pupils would surpass (in achievement) 
older, dull pupils of comparable MA’s. They are likely to be better 
motivated, they have been expected to do well in school, they are 
more likely to be accustomed to success. In general they have had 
better early home training and have had more encouragement and 
stimulation at home. Of course, these advantages may be offset 
by the fact that at the age level at which the work just mentioned 
was done the older pupils had spent twice as many years in school 
as the younger ones. 

At the present time we are witnessing a growing interest in the 
education of gifted pupils. This is obliged to sharpen interest in the 
nature of the mental abilities of the gifted, especially the highly 
gifted. Perhaps we would be inclined to infer from the results of 
the work on the generality of the Mental Age that ten-year-old 
pupils earning MA’s of fifteen would behave on such tests as the 
WISC and the PMA much as fifteen-year-old pupils of average in- 
telligence. 

The work on the mental age concept just referred to certainly 
suggests that for the age ranges involved, the MA as a unit of 
measurement possesses a high degree of generality. This is not to 
defend the Stanford-Binet procedure of designing intelligence tests. 
1 do mean to defend the MA as a highly useful unit of measure- 
ment. The Wechsler method of designing intelligence tests, while 
it may not produce a superior result, appears to be more efficient 
and more economical. 

The logie of the argument that the MA as a unit of measure- 
ment suffers from known or suspected inequalities of its units is 
not impressive. The meaning and usefulness of Height Age or 
Weight Age or Dental Age are unaffected by the fact that these 
units prove to be unequal when tested by a metrical scale. A height 
age of nine is scarcely more meaningful in school work because we 
know how much this is in metrical units. A Mental Age, or for that 
matter a Reading Age, of nine is not less meaningful because we 
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are unable to say how much intelligence or how much reading 
ability it is. 

Moreover, the inequality of successive MA units, or our lack of 
knowledge regarding their equality, argues nothing with respect to 
the equality of IQ's obtained at successive age levels. The IQ 
defines a pupil's mental development in relation to the mental de- 
velopment of other pupils of like age in the general population. In 
a sense it defines the rate of mental growth. We may say that a six- 
year-old pupil who achieves an IQ of 125 develops at the rate of 
1.25 of a mental year for each calendar year. At this age his at- 
tainment equals that of the &verage pupil seven and a half years 
old. He has gained a year and a half in six years. He would be ex- 
pected to gain another year by the time he is 10. We could also 
say he is approximately 1.5 S.D.'s above the mean for his age, or 
that he exceeds about 93 per cent of the pupils of his age in the 
general population. The fact that the MA score from which the IQ 
is derived at successive ages may not be equal has nothing to do 
with a pupil’s maintaining his relative position in his age group. 

Terman’s concept of validity of the IQ implies an absence of 
systematic variations in IQ with age. Given constant means and 
S.D.’s over an age range, a given IQ must signify the same degree 
of brightness at successive age levels within this range. This as- 
sumes nothing with respect to the equality of the MA units from 
which the IQ's are derived, Unfortunately, test designers have not 
always been as careful as they should have been to publish S.D.'s 
for successive ages over the range of years covered. In the absence 
of this or comparable information we cannot tell whether or not 
an IQ of a given magnitude at one age level is strictly comparable 
numerically to one of like magnitude at another age level. 

One of the more troublesome practical problems is the com- 
parability of IQ's obtained on different tests—one might say need- 


lessly troublesome. Tt should not have been a difficult matter at all 
for test designers to 


school officials with t 
ing of numerically e 

In school work t 
nificance of fluctu 
matter whether th 
occasions. When test designers 


: ; as Terman and Wechsler have 
done, provide the 
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ily acquaint teachers with the amount of fluctuation required on a 
given test to become significant at given levels of confidence. There 
is no excuse for test authors’ not doing this. 

Traditionally our schools have been interested in several kinds 
of growth data. The variables for which we have the most satis- 
factory measures are mental growth, educational growth, and 
physical growth. It is of interest to note that these three types of 
growth possess considerable stability. Correlations between in- 
telligence scores earned at eight and at eighteen years of age run 
around 0.85, as seen, for example, in Bayley’s work (1). Correla- 
tions between measures of stature, obtained at eight and at eight- 
een years are of somewhat comparable magnitude (9), as are 
those between measures of educational achievement taken ten 
years apart. 

We also know from an abundance of research that intelligence 
scores and educational achievement scores are rather closely inter- 
correlated, and that neither of these is closely intercorrelated with 
measures of physical growth. Moreover, we can improve prediction 
of educational achievement somewhat by combining intelligence 
and past achievement scores. We do not, as the work of Gates (6) 
showed more than twenty-five years ago, improve prediction by 
combining intelligence scores with various measures of physical 
growth. 

However, the statistical method ordinarily used, that of mul- 
tiple correlation, precludes our obtaining a lower correlation than 
that obtained between intelligence alone and achievement. If, on 
the other hand, we form simple averages involving MA’s and vari- 
ous physical growth age scores, such as Height Age, Weight Age, 
Grip Age, and so on, and correlate the averages thus obtained 
with achievement, we do stand to obtain a much lower correlation. 
Indeed, if as many as five or six physical age measures are thus 
combined with MA and the resulting averages correlated with 
achievement, the coefficient should approach in magnitude the 
coefficient obtained between physical growth measures and 
achievement. 

This is not to controvert the importance of physical growth in 
the conduct of schools. Physical growth features in the health 
and well-being of children. It is probably an important factor in 
their social and emotional adjustment. Certainly it would appear 
that any serious deviations should pose personal problems to which 
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adjustment must be made. But we do not improve, but lower, 
prediction of achievement by combining physical age scores with 
MA scores. 

For about as long as I have been acquainted with the subject 
at hand, there has been some expressed concern about the effect 
of reading proficiency on group intelligence scores. To be sure this 
is primarily a practical problem. But I suspect it could be argued 
that some of us have published papers on problems that had less 
practical—or theoretical—significance. In a way it is a bit strange 
that more work has not been done on this problem, if for no other 
reason than that of allaying the concern of teachers and school 
officials. 

Some work of one of our own students on an MA thesis suggests 
that a slow reader who has good reading comprehension is penal- 
ized, so to speak, on timed group tests (10). No evidence was ob- 
tained to suggest that the pupils scoring low on reading compre- 
hension, whether fast or slow in rate, were penalized in any way. 
It seems probable that a facile reader who enjoys excellent reading 
comprehension would outperform on timed group intelligence tests, 
an equally intelligent pupil who reads slowly. Whether such an out- 
come would help or hinder prediction would depend upon the re- 
lationship between reading rate and educational achievement. One 
would suspect that the relationship is far from close. It is con- 
ceivable, as Durrell (4) pointed out long ago, that since reading 

proficiency is correlated with achievement, the presence of a read- 
ing factor in group intelligence tests may actually improve their 
predictive power. 

Obviously, we would not expect the effect of reading proficiency 
on group intelligence test scores to be the same for all such tests. 
We would anticipate that the obtained effect would, among other 
things, prove to be a function of the speed-power variable. " 

By way of closing, some brief remarks are made apropos of the 
social Status question in intelligence testing. Without doubt, the 
publieity given this question by the work of Eclls, Davis and 
colleagues (5) and the earlier work of Cattell (3) should have a 
wholesome effect upon test construction. As a group, test authors 
in the past were probably guilty of some degree of carelessness with 
respect to this problem. It is perhaps too early to attempt to 
resolve the important issues raised. The need is urgent to under- 
stand the meaning of social class differences in test scores as 
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psychological phenomena. The use of such expressions as “culture 
fair" and “culture bias” gives the impression that these differences 
are the unwitting work of test authors. I say unwitting because 
surely no one would have done this by design. Obviously, if these 
differences are chargeable to test authors, they are unimportant 
as psychological phenomena, save as they become something to 
be studiously avoided. On the other hand, we may find that the 
problem is really too big for the test author. It may turn out that 
the kinds of cultural impact associated with social class differences 
affect the course of mental development of children as well as 
their performance on intelligence tests. 

Social class differences are real differenees—substantial psycho- 
logical phenomena with which schools and society must deal. It 
may be that we cannot build valid intelligence tests which will 
not at the same time discriminate among the social classes. Or, 
if we start the other way round by designing tests which will not 
discriminate among social classes, we may find that the tests are 
poor predictors of academic achievement. 
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Extrapolating from general principles of educational psychology 
and from related experimental evidence, many educators and psy- 
chologists implicitly believe that intention to remember facilitates 
the retention of learned verbal materials. Actually all of the evi- 
dence bearing on this issue is ambiguous and inconclusive for one 
or both of two reasons: the experimental arrangements have been 
such as (1) to induce intentions to learn rather than to retain, or 
(2) to make impossible the isolation of the effects of intention on 
learning from its effects on retention. The present study was de- 
signed to overcome these interpretive ambiguities by both intro- 
ducing explicit intentions to remember for an extended period of 
time and by experimentally isolating the effects of such intentions 
from original learning. The latter result could only be insured by 
introducing intentions to remember after completion of the learn- 
ing task. 

Numerous studies have shown that intention to remember influ- 
ences the quality and longevity of retention. When subjects learn 
material with the expectation of recalling it for a designated period 
of time, recall is superior for the expected as against either a shorter 
or longer interval (2, 3, 8). Lester demonstrated that retention is 
enhanced by expectation of recall and by foreknowledge of the oc- 
currence and possible effect of interpolated materials (5). Unfor- 
tunately, however, since differential intentions to remember were 
introduced at the time of original learning, these experiments did 
not isolate the effects of intention on the quality or quantity of 
what was learned in the first place from its effects on retention 
per se. 

Another group of studies (e.g., 6, 7) has shown that retention 
is superior and retroactive inhibition less marked when practice is 


accompanied by "intent to learn" than when learning takes place 
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incidentally. That this difference is largely attributable to superior 
original learning, however, is demonstrated by the fact that it no 
longer prevails when Ss are equated for original mastery of the 
learning task (1). Heyer and O’Kelly (4), on the other hand, 
showed that learning under conditions of strong ego-involvement 
did significantly increase retention even though their ego-involved 
and control groups could not be differentiated on the basis of 
original learning outcomes. It is quite possible, however, that the 
superiority of the ego-involved group on original learning was 
masked by the inhibitory effect of intense ego-involvement on 
initial test performance. In any event, the evidence yielded by all 
of these studies can at best be indirect, because in each instance an 
explicit experimental set was induced to learn material for im- 
mediate reproduction rather than to retain it for an extended pe- 
riod of time. Furthermore, even if ego-involved intention to learn 
leads to no demonstrable quantitative superiority in original learn- 
ing, it may still affect retention by influencing certain qualitative 
aspects of the learning process. 


METHOD 


Standardization of learning material and tests of retention. The 
learning material used in this study was a passage of approxi- 
mately one thousand four hundred words on the history of opiate 
addiction. It was selected on the basis of being generally unfa- 
miliar and inaccessible to undergraduate students, but yet learn- 
able and relatively interesting. Two equivalent multiple choice 
tests on this material were constructed as follows: fifty-six five- 
choice test items were administered to a group of fifty-five under- 
graduate students in education three days after they had studied 
the passage for twenty-five minutes, Following item analysis, the 


order of items was rearranged so as to match odd and even items 
with respect to 


Influence of Intention on Retention 89 


students enrolled in four sections 
of an cducational psychology course at the University of Illinois. 
Of these, fifty-three were in the control group and forty-four in the 
experimental group. In order to minimize the possibility and conse- 
quences of communication between groups, the control group was 
handled during the Fall semester and the experimental group dur- 
ing the following Spring semester. The experiment was rationalized 
as a laboratory exercise and was performed during scheduled class 
hours. The control Ss were first instructed to read the passage at 
their normal reading speed (approximately eight minutes), and 
then to utilize the remainder of the twenty-five available minutes 
to commit the facts and concepts in the passage to memory. It was 
explained that a multiple choice examination on the material would 
be given immediately after the reading was completed. Ss were 
assured that the results on the examination would not be made 
publie and would not affect their grades in the course. They were 
promised, however, that their scores would be returned to them in- 
dividually, and that the median and frequency distribution of the 
class scores would be placed on the blackboard so that they could 
compare their own scores with the class norms. Fourteen days after 
the original learning and test, an unannounced (surprise) retest 
was given, utilizing the equivalent form of the test. Hence the con- 
trol group had no explicit intention to remember the material for 


the second test. 
During the second semester the same procedure, with one ex- 


ception, was followed for the experimental group. Immediately 
after the Ss finished the first test, they were informed that a second 
test on the same material would be given in fourteen days. This 
bly induced explicit intentions to retain 
rial for an additional two weeks with- 
quantity of original learn- 


of ninety-seven undergraduate 


announcement presuma 
the previously learned mate 
out in any way affecting the quality or 


ing. 
RESULTS 
Means and standard deviations of learning (test) and retention 
ol and experimental groups are shown in 


(retest) scores for contr rou 
Table I. Comparison of test and retest scores indicates that a sig- 


nificant degree of forgetting (P < 0.001) occurred in each group 
during the two week interval. The mean learning and retention 
scores made by the two groups were also significantly greater in all 
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TABLE I—Comparison OF CONTROL AND EXPERIMENTAL GROUPS ON 
LEARNING AND RETENTION TESTS 


Control Group Experimental Group 

Test Retest Test Retest 

Mean 20.1 13.2 19.5 13.6 

S.D. 2.53 3.06 2.56 2.58 

Corrected mean* 13.4 6.5 12.8 6.9 
Percentage retained{ 48.5 53.9 


*'The corrected means were computed by subtracting 6.7 (the mean 
Score of subjects who took the test without studying the material) from 
the group means of raw scores (number of correctly answered items). 

T Corrected means were utilized in computing the percentage of retention 
from test to retest. 


instances than the test scores made by different Ss who answered 
the test questions without prior study of the learning passage. The 
product-moment correlation between test and retest scores was 
0.45 (p < 0.01). 

Since the means, standard deviations, and variances of the two 
groups are not significantly different from each other on the learn- 
ing test, we are justified in comparing control and experimental 
groups with respect to percentage of retention from test to retest. 
The respective percentages of 48.5 and 53.9 show only a negligible 


difference in degree of retention which does not even begin to ap- 
proach statistical significance. 


DISCUSSION 

The results of the 
iew that intention t 
verbal material. In reconcilin 


Obviously, 
first place bec 
he will have 


al more thoroughly in the 
ed on it several weeks later 
er than a student who origi- 
he does not expect to be 


DW 
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retested. However, if the intention to remember is introduced after 
learning takes place it apparently has no influence on the longevity 
of the learned material. Intention to remember, in other words, af- 
fects retention by enhancing learning rather than by increasing 
the stability of existing memory traces; or, to put it negatively, 
intention not to remember material beyond the date of the examin- 
ation leads to little subsequent retention not because of any direct 
influence of this intention on memory traces but because such in- 
tentions are invariably associated with superficial and cursory 
methods of original learning. Hence, teachers who wish to increase 
the longevity of learning by maximizing intention to remember 
can only do so by inducing the appropriate intention at the time 
of original learning. Admonitions to remember once learning is 
conipleted, even if they lead to corresponding intentions in the 
student, do not alter the subsequent fate of memory traces. 

One possible qualification of these findings should be consid- 
ered. Intention to remember in our Ss might have been relatively 
weak in the absence of tangible material rewards for superior re- 
tention. High retention scores, after all, did not “count” in the 
sense of improving students! grades. On the other hand, level of 
ego-involvement was maintained at à reasonably high level by 
intrinsic desire to coóperate in a research study relevant to the 
subject matter of the course and by Ss’ incentive of scoring favor- 
ably in relation to class norms. 

SUMMARY AND CONCLUSIONS 

An attempt was made to ascertain whether explicit intention to 
remember facilitates retention of learned school materials apart 
from its influence on original learning. An experimental group of 
undergraduates studied a historical selection for twenty-five min- 
utes and was tested on same immediately afterwards by means of a 
multiple choice test. After this test, an explicit intention to remem- 
ber was induced by announcing that an equivalent form of the test 
would be given two weeks later. The same procedure was followed 
with a control group except that an unannounced retest was given. 

The two groups were not significantly different in mean learning 
scores or in the percentage of material retained from test to re- 
test. It was concluded that intention to remember facilitates re- 
tention by enhancing original learning rather than by increasing 


the stability of existing memory traces. 
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THE FACTORIAL INVARIANCE OF 
PURE-FACTOR TESTS' 


JOHN W. FRENCH 


Educational Testing Service 


Current practice in assembling batteries of pure-factor tests has 
been to select, for each factor, one or two tests that have been found 
ysis to have high loadings on that factor and low 
actorial invariance is assumed. That is, if 
nt each of a number of 


by factor anal 
loadings on all others. F 
several tests were to be selected to represe 
factors and tried out with a new group of subjects, it would usually 
be assumed that the factors would reappear clearly defined by the 
expected tests. The analysis described in this paper was carried 
out to check on the invariance of a set of pure-factor tests selected 
for possible use in making educational predictions. 

The tests making up the experimental battery of pure-factor 
tests were suggested by tests used in the studies covered in a survey 
of the factor-analysis literature (2). In general, three tests were 
selected as the purest and most reliable measures of each of thirteen 
of the best-established aptitude and general achievement factors. 

Deviating from this pattern, Reading Comprehension and Math- 
ematics Reasoning were used as measures of two factors each: 
Verbal Comprehension and General Reasoning for the former, 
Number and General Reasoning for the latter. Although these two 
are not pure-factor tests, they were included because of their 
known validity for a number of purposes. The Carefulness factor 
represents another deviation from the pattern of three tests for 
each factor. Since not much confidence was felt in the way that the 
scores for the Carefulness factor would turn out, five tests were 
tried. Four are used in the analysis; the other was eliminated, 
since cighty-five per cent of the subjects received a zero score. It 
should be pointed out that the Carefulness scores were derived 
from the tests of other factors. However, it was believed that there 
would be little distortion of the factors produced by experimental 
made to the staff of the U. S. Military 


Academy for their coöperation in scheduling and administering the tests 
and in supplying data on course grades. Without this kind of coóperation 


the study would not have been possible. 
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dependence, because the four Carefulness Scores bore no consistent 
relationship to the scores on the four tests from which they were 
derived. 


THE TESTS 


The tests are described in the order in which they were adminis- 
tered. The first four were a part of the entrance tests given five 


estimate of speededness. The word “speeded” is used to indicate 
that very few or no subjects finished. “Some speeding” indicates 
that it was estimated that twenty-five per cent to seventy-five per 
cent finished. Most of the tests in the experimental battery were 
given in two Separately-timed parts in order to make possible the 
computation of alternate-form reliability coefficients. The coeffi- 
cients corrected by the Spearman-Brown formula are given with 
the test descriptions. A full description of the test is not given when 
the name of the test or the kind of test is familiar to most readers 


(2). 


For Verbal Comprehension 
Verbal 


(1) Antonyms (Entrance). 40 items, 15 minutes, 93% finished. 
(2) Reading Comprehension (Entrance), 20 items 25 minutes. 


verbal-completion items. 40 items, 30 minutes. 88% fin- 
ished. 


For General Reasoning 
Reading Comprehension, (Described above.) 
(4) Mathematics Reasoning (Entrance). 20 items, 30 minutes. 
61% finished. This test was intended to measure both Gen- 
. eral Reasoning and Number. 
Variables 5 to 44 were Scores from the experimenta] battery. 


(5) Figure Analogies, Two parts, each 20 items, 10 minutes. 
Some Speeding. R = 0.64 
For Space 


(6) oe Figures. From four Squares containing black and 
White areas the one was to be selected which was not simply 


(7) 


(8) 
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rotated but was the mirror image of the other three. Two 
parts, each 40 items, three minutes. Speeded. R = 0.89. 
Block Counting. In pictures of large piles of similarly 
shaped blocks, the subjects were to count the shaded blocks 
that touched each numbered, unshaded block. The contacts 
between the blocks had to be inferred from the parts of the 
blocks shown. Two parts, each of five piles and 50 items, 
five minutes. Some speeding. R = 0.82. 

Revised Minnesota Paper Form Board. Series MA (Psy- 
chological Corporation). One of five patterns was to be 
selected which could be produced by putting together given 
outlined parts. 60 items, 25 minutes. Speeded. 


For Speed of Judgment 
(9) Visual Preferences. Each item consisted of a pair of some- 


(10) 


(11) 


simple line drawings or figures. The subject 


what similar, 
e score was the total num- 


checked the one he liked best. Th 
ber checked. No account was taken of the kinds of figures 
preferred. (For this test and the following two tests, ap- 
proximately every fifth item presented a very easy choice 
so that subjects responding at random could be detected. 
No such subjects were found.) Two parts, each 50 items, 
two minutes. Speeded. R = 0.88. 

isted of pairs of names of 


Social Judgment. Each item consi 
personal qualities. The subject checked the quality he 


would prefer to characterize someone with whom he had to 
associate closely. The score was the number checked. No 
s taken of the kinds of qualities preferred. Two 
two minutes. Speeded. R = 0.92. 

Size Judgment. Each item consisted of pairs of descriptions 
of objects. The subject checked the larger (assuming stand- 
ard or average sizes). The score was the number checked. 
No account was taken of the correctness of the judgments. 
Two parts, each 50 items, two minutes. Speeded. R = 0.92. 


account wa 
parts, each 70 items, 


For Perceptual Speed 


(12) 


(13) 


Picture Discrimination. Each item consisted of three sim- 
ple drawings of faces, two exactly alike, and one different 
in some respect. The subject checked the different one in 
each item. Two parts, each 56 items, two minutes. Speeded 
R = 0.89. , 
Cancellation. Each part consisted of a page of typed, ran- 
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dom, capital letters reproduced in red. The task was to 
draw an X over every A, starting at the top of the page and 
working down. The score was the number of A’s cancelled. 
Two parts, each two minutes. Speeded. R = 0.96. 

(14) Number Checking. This test is similar to the Minnesota 
Clerical Aptitude Test. Two parts, each 48 items, two min- 
utes. Speeded. R = 0.89. 

For Number 
Mathematics Reasoning (Described above.) . 

(15) Checking Addition and Division. This was an alternation 
of five items, each of simple addition and division of 1- 
and 2-digit numbers, with an answer given. The answer was 
to be checked as correct or wrong. Two parts, each 50 items, 
two minutes. Speeded. R = 0.89. 

(16) Subtraction and Multiplication. This was simple subtrac- 
tion and multiplication of 1- and 2-digit numbers in spiral, 
omnibus form with five response alternatives, Two parts, 
each 42 items, two minutes. Speeded. R = 0.91. 

For Carefulness (with these variables the signs of the correlations 
were reversed, so that high scores would represent favorable per- 
formance) 

(17) Picture Discrimination (Wrong). This was the wrongs 
score on Variable 12. R = 0.56. 

(18) Checking Addition and Division (Wrong). This was the 

wrongs score on Variable 15. R = 0.42. 

Subtraction and Multiplication (Wrong). This was the 
Wrongs score on Variable 16. 72 — 0.38. 

(20) Cancellation (Omit). This was the number of uncancelled 


A’s prior to the last A cancelled in Variable 13. 
For Visualization 


(21) Similar Rotations. 
different rotational 


(19) 


; o 
The same solid form is shown in a 
Positions in each item, labeled 1 "i á 
form, labeled 3, was to be visualized & 


speeding: 538% fi 
0.69 


(22) Folded P. aper Test. This test was similar to Thurstone’s 


" 


(23) 
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Paper Folding Test (2). Two parts, each 20 items, five 
minutes. Some speeding. R = 0.71. 
Mechanical Movements. Drawings were presented of dif- 
ferent kinds of mechanical apparatus involving gears, 
belts, eccentrics, etc. One or two items based on each draw- 
ing asked about the direction, speed, or nature of the move- 
ment within the apparatus pictured. Two parts, each 10 
items, eight minutes. Some speeding: 25% finished Part 1; 
31% finished Part 2. R = 0.68. 


For Induction 


(24) 


(25) 


(26) 


Inducing Rules. This test presented, in spiral omnibus 
form, items requiring the subject to discover & rule and 
then to answer the item by applying it. Included were 
analogies, series, and belonging items drawn from verbal, 
social studies, science, numerical, and graphic material. 
Variety was introduced to prevent the subjects from using 
a system for solving the problems. Two parts, each 24 
items, seven minutes. R = 0.72. 

Letter Sets. The set of four letters not belonging with other 
sets of four letters was to be chosen. Two parts, each 15 
items, five minutes. Considerable speeding: 7% finished 
Part 1; 20% finished Part 2. R = 0.43. 

Locations. This test was adapted from Thurstone’s Marks 
Test. Each item presented five groups of dashes with spaces 
between them. An X replaced one dash in each of the first 
four lines, following some rule for placement. The subject 
was to indicate where the X should appear in line 5. Two 
parts, each 24 items, five minutes. Speeded. R = 0.72. 


For Fluency of Expression 


(27) 


(28) 


Metaphors. This was similar to Thurstone’s Similes. The 
subject was required to complete a simile in three different 
ways. The score was the number of completions regardless 
of their quality; no entirely senseless ones were made. Two 
parts, each 10 items, three minutes. R = 0.84. 

Double Opposites. This was similar to Thurstone’s test 
of the same name. Each item presented a word and called 
for writing two words opposite to it in meaning and be- 
ginning with specified letters. The score was the total num- 
ber of correct responses. Two parts, each 25 items, four 
minutes. R = 0.72. 


: Edni. 
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(29) 
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Multiple Expressions. Part I required the subject to write 
down as many ways as he could of requesting a second 
helping of meat. Part II required the subject to write down 
as many ways as he could of asking someone for his name. 
In each case the score was the number of expressions writ- 
ten down regardless of their quality ; no senseless ones were 
made. Two parts, each four minutes. R = 0.82. 


For Aiming or Eye-hand Coórdination 


(30) 


(31) 


(32) 


Dotting. 'The subject placed a dot inside as many type- 
writer-sized capital O's as he could. The O's were irregu- 
larly spaced along crooked lines. The score was the number 
of O’s correctly marked. Very few were incorrectly marked. 
Two parts, one minute each. R = 0.92. 
Tracing-Easy. A line was to be drawn as rapidly as possi- 
ble through lines 4 inch apart having 246 -inch open- 
ings. The score was the number of lines correctly passed. 
Very few were passed incorrectly. Two parts, one minute 
each. R = 0.92. 

Tracing-Difficult. A line was to be drawn as rapidly as 
possible through lines Mo inch apart having %-inch open- 
ings. The score was the number of lines correctly passed. 
Very few were passed incorrectly, Two parts, one minute 
each. R = 0.94. 


For Motor Speed 


(33) 


(34) 


(35) 


For Mechanical Knowled 


Writing X’s. This Score was the number of X’s drawn on or 
near regularly spaced dots, Two parts, one minute each. 
R = 0.92. 

Writing “lack,” This score was the number of times the 
word “lack” could be written (or printed) on or near regu- 
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of three tools or other objects. The task was to select the 
one of the three tools which was commonly used with the 
first object. Two parts, each 15 items, five minutes. Power 
conditions. R = 0.80. 
(38) Mechanical Comprehension. This was patterned after 
the Bennett test. Two parts, each 15 items, five minutes. 
Power conditions. R = 0.64. 
Variables 39-44 were scores taken from a ninety-item multiple- 
choice information test designed to measure interest in six areas 
on the assumption that a subject who has been sufficiently inter- 
ested in an area to participate in it or read much about it will have 
picked up factual information about it. The scores are the number 
of items correct among the 15 (placed in random order) represent- 
ing each area. Total time, 50 minutes. Unspeeded. The areas cov- 
ered were: 
(39) Interest Information (Science). Common-sense science. 
(40) Interest Information (Social). Games, dancing. 
(41) Interest Information (Financial). Economics, banking, 
stock-market. 
(42) Interest Information (Athletic). Baseball, sailing, bowl- 
ing, etc. 
(43) Interest Information (Art and Music). Painting, great 
music, technical information. 
(44) Interest Information (Mechanics). Home shop tools, auto 
mechanics. 


THE SUBJECTS AND THE ADMINISTRATION 


The battery, five and one-half hours long including directions 
and administrative time, was given to about seven hundred and 
sixty plebes shortly before the end of their first summer at the 
United States Military Academy at West Point. With a ninety- 
minute history examination and a twenty-minute biographical in- 
ventory, it was given in two half-day sessions on successive days. 
In order to include some of the entrance test variables in the study, 
it was necessary to restrict the group to those who had been “Regu- 
lar” and “Validating” candidates. This left out the “Certificate” 
candidates, i.e., those Congressional appointees who are certified 
by their schools. The number of subjects with scores on all forty- 
four variables was four hundred and ten. 
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THE CRITERION VARIABLES 


Five additional variables as follows were obtained when first- 
year course grades became available in the summer of 1951: 

(45) Language Grades. Choice within quotas of French, Ger- 
man, Portuguese, Russian, or Spanish. Fundamentals of 
grammar, reading, writing, and particularly oral practice. 

(46) English Grades. Includes review of fundamentals, compo- 
sition, reading of literature with class discussion and 
themes, and speech making. f 

(47) Military Topography and Graphics Grades. Use of in- 
struments, engineering drawing, freehand and design 
sketching, map projections, descriptive geometry, and 
charts and graphs. 

(48) Mathematics Grades. Includes algebra, trigonometry, solid 
geometry, analytic geometry, and an introduction to cal- 
culus. 

(49) Aptitude for Service Ratings. Ratings on leadership po- 
tential. These are made by an officer advisor and by all 
fellow cadets in the cadet’s company including those from 
all four classes. The officer’s rating counts one-third; the 
cadets’ ratings count two-thirds, 

The product-moment intercorrelations were computed for the 44 
variables on the four hundred and ten cases. For the tests which 
were divided into two parallel parts, the score used for the inter- 
correlations was the sum of the part scores. 

For all correlations with the criterion variables the number of 
subjects was three hundred and sixty-one. This reduction was 
caused by normal attrition that took place during the first year at 
the Academy. The intercorrelations and the means and standart 
deviations may be obtained from ADI (8). Attrition at the d " 
emy during the first Year tended to raise the means and decrease th 
standard deviations a little, as might be expected. For tests given 


in two parts the alternate-form reliability coefficients correcte 
the Spearman-Brown formula are given in the diagonal of the 
able. 


THE FACTOR ANALYSIS 
Factor analysis of the 49 


: -variable intercorrelation matrix was 
carried out by means of T 


hurstone's Grouping Method (5). The 
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highest value in cach column was used as an estimate of the com- 
munality.? 

Sixteen factors were extracted before the res 
reduced to near the chance level. The groups o 
the extraction of the sixteen reference factors may 
ADI (8). 

After the sixteen factors ha 
seemed significant could be foun 
value among the residuals was 0.0 
siduals and the unrotated factor ma 
ADI (8). 


iduals seemed to be 
f variables used for 
be obtained from 


d been extracted, no pattern that 
d among the residuals. The largest 
79. The distribution of the re- 
trix may also be obtained from 


THE ROTATION 
tructure was carried out by the 


method described by Thurstone (5, p. 194), where all pairs of fac- 
tors are plotted on paper and the new positions for the hyper- 
planes are drawn on the paper, so that the direction cosines for the 
new rotation may be read from the graph. Seven sets of rotations 
were carried out, including à total of one hundred and twenty ro- 
tated pairs. A new set of graphs was assembled after each set of 
rotations. The final transformation matrix may be obtained from 
ADI (8). The final oblique solution is presented in Table I. Load- 
ings of 0.20 or higher are printed in parenthesis. Communalities de- 
rived from the unrotated matrix are given in the right-hand col- 
umn, Table II shows the intercorrelations of the factors. —— 
In the process of rotation it was borne in mind that the infinite 
number of rotational solutions simply represent alternate ways of 
explaining the intercorrelations that were found. No one solution 
thematically correct than another. While simple struc- 


is more ma : : 
ture was the general goal, this concept is 80 variously or vaguely 


defined that there is much freedom for variation within its limits. 
Rotation in this study was not done blind, but was carried out so 
that the result would be of maximum value to the broad purpose 


Rotation to oblique simple $ 


2 After the extraction of the ninth factor, the values used for the com- 
munalities were the highest values in the columns of the residual table. 
Up to the extraction of the ninth factor the highest values in the columns 
of the original table were inserted in the diagonals of the residual tables. 
Since no groups had overlapped up to this point, it made little difference 
whether the communalities of members of the groups were estimated from 
the original or the residual tables. 
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TABLE II—INTERCORRELATIONS OF THE FACTORS 


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 

1 —.02| .05| .01| .O1| .05| .27| .35 | .13| .24| .60| .40| .31|.29 | .01|—.20 
2 |—.02 03| .42| .38| .55| .20| .58 | .10| .18| .27| .11| .33| .23| .74| .05 
3 | .05| .03 os| .17| .18|-.23| .00 |-.10|—.03| .24| .19|—.22| .37 |-.11| .14 
4 | .O1| .42| .05 .85| .54| .18|.13 |—.07| .30| .18| .07| .11|.08 | .30|—.01 
5 | .01| .38| .17| .35 34| .13|.25 |—.32| .49| .27| .07| .20|.08 | 41] .03 
6 | .05| .55| .18| .54| .34 18} .36 |-.11| .19| .24| .07| .24|.27 | .34| .07 
7 | .27| .20|—.23| .18| .13| .18 23 | .09| .15| .33| .o8|—.02| .09 | .34| .14 
8 | .35| .58| .00| .13| .25| .30| .23 .97| .10| .47| .40| .43| .48 | .48| .01 
9 | .13| .10|-.10|-.07|—.32|—.11| .09| .37 —.28| .12| .25| .20| .15 | .18|—.04 
10 | .24| .18|-.03| .30| .49| .19| .15| .16 |—.28| 31| .13| .26| .07 | .24|—.09 
11 | .60| .27| .24| .18| .27| .24| .33| .47 | .12| .31 61) .38| .43 | .41)—.15 
12 | .40| .11| .19| .07| .07| .07| .08|.40 | .25| .13| .51 30| .58 | .30|—.20 
13 | .31} .33|—.22| .11| .20| .24|—.02| .43 | .20| .20| .38| .30 os| .41|—.18 
14 | .39| .23| .37| .00| .08| .27| .09| .48 | .15| .07| .43| .58| .08 08| .16 
15 | .04| .74|-.11| .30| .41| .34| .34| .48 | .18| .24| .41| .30| .41| .08 Sali 
16 |—.20| .05| .14/—.01) .03) .07| .14] .01 |—.04—.09/—.15—.20 —.18| .16 |—.11 


of the research, that of identifying a useful set of pure-factor tests 
by proving that an expected factor structure would appear. The 
criterion of simple structure and a positive manifold of loadings 
was responsible for a great majority of the decisions made in the 
rotation, but two examples may be mentioned to show the way in 
which other conditions played a part. 

(1) This study does not reveal Space, Induction, and Visualiza- 
tion as three separate factors. Rotation could have been such that 
any two of these three factors would appear as primaries with the 
third being a combination of the other two. The final rotation was 
made to show the spatial tests as loading both Induction and Vis- 
ualization, because this situation had the readiest psychological 


interpretation and, therefore, served as the best framework for the 
development and underst 


(2) 


tors consisted of the three “ 
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factors can, therefore, be relied upon to appear and may well be 
useful dimensions in a testing program. 


INTERPRETATION OF THE FACTORS 


The factors are named to indicate their identification with fac- 


tors found by other investigators. Most of them are now included 
in the Kit of Selected Tests for Reference Aptitude and Achieve- 
ment Factors (4), which was developed by committees of investi- 
gators for the purpose of defining some of the better established 
factors. The selection of some tests for the Kit was based partly 
on the findings of this analysis. 

Following each factor name there appears the 
Number" of the factor from a numbering system 
was initiated by Cattell (1). 

Because this study was undertaken to confirm the establishment 
of familiar factors, which have been described elsewhere (2 and 3 
and original sources referred to therein), the factor deseriptions 
and the discussions are brief. 

Factor I is Verbal Comprehension, Universal Index No. T13: 
Knowledge of the English language; including the skill necessary 
both to read and to write it. 

Factor II is Speed of Symbol Dis 
No. T12: The ability to find a known symbol amidst a mass of dis- 
tracting stimuli. Tests 12, 13, and 14 were included to define the 
“perceptual speed” factor. However, “perceptual speed,” thought 


by the writer at the time of testing to be a single entity, actually 
i f at least two factors, one of which is speed of symbol 


discrimination. Another “perceptual speed” factor calls for speed 


in perceiving detail. Variable 12 is a test of this other factor. Its 
low loading seems to confirm that fact. 

Factor III is M echanical Knowledge, Universal Index No. T8: 
A knowledge stemming from instruction or experience in the field 
of mechanics. This factor is unique in this battery of so-called ap- 
titude factors, since it depends on variance in achievement. It was 
included in this aptitude battery because mechanical knowledge 
plays such an important part in our culture that it is likely to be 


of consequence in predicting success in various academic or voca- 


tional fields. 
Factor IV is Motor Speed, 


“Universal Index 
for factors which 


crimination, Universal Index 


Universal Index No. T9: Speed in 
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moving the fingers in complex patterns that are well fixed by habit. 
This may be the same as the U. S. Employment Service's Finger 
Dexterity factor (2), and seems certainly to be the same as Bech- 
toldt’s “Motor Speed” factor (2), and Woodrow's “Speed” factor 
(7). It is not known to what extent this factor may be generalized 
beyond tasks carried out with the hand holding a pencil. 

Factor V is Speed of Judgment (not indexed) : Speed or lack of 
compulsiveness in making minor decisions. The large individual 
differences on tests of this sort are probably affected rather 
strongly by the situation and the test directions. However, it seems 
reasonable to assume that a part of this variance represents a mode 
of response habitual to the individual, 

Factor VI is Aiming or eye-hand coordination, Universal Index 
No. T1: The three variables having high loadings on Factor VI 
in this study are all performed with pencil and paper. These tests 
occur in several studies in the literature without evidence that the 
factor encompasses more than the paper and pencil situation (2). 

Factor VII is the Grades factor (not indexed): The variance 
that is common to course grades but not to any tests. This variance 
probably covers a wide variety of characteristics including such 
things as classroom attitude and parental pressure. 

Factor VIII is a combination factor of Induction and Space, Uni- 
versal Index Nos. T5 and T11: Induction involves the findings of 
general concepts that fit sets of data, the forming and trying out of 
hypotheses. Space is the ability to comprehend the nature of the 
arrangement of elements within a visual stimulus pattern primarily 
with reference to the location of the examinee. The zero hyperplane 
of this factor could have been rotated to a different position with 
respect to Factor XIV, but the present rotation is the one which 
Pg deri a The Space factor appears between I oe 
This mabes Pin and lies in the plane determined by these two. 
“Spatial Reasonin o ological Sense, as suggested by the term 

Soning” so often used instead of “Space.” The three 


variables 6, 7 : " 
“Space” Ad. and 8) put into this battery to represent the 
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no consistent correlation between the carefulness scores and the 
speed scores derived from the same tests. Because this is true, there 
can be no serious distortion of the factor pattern resulting from use 
in the analysis of two scores derived from each of four tests. 

Factor X is Fluency of Expression or possibly Idea Fluency. 
(The latter would be Universal Index No. T6) : The three tests were 
placed in the battery to define Fluency of Expression, the ability, 
once an idea is given, to express that idea rapidly in many different 
ways. However, the distinction between new ideas and new ex- 
pressions of an idea is not clear and is not established here, be- 
cause no recognized test of Idea Fluency was included in the bat- 
tery. Such a test, for example, might call for writing down as many 
ideas as possible about some such topic as “a house.” The score on 
this kind of “Idea Fluency” test would be the number of ideas 
written down within a given short time limit. 

Factor XI appears to be Interest in Science (not indexed) or an 
information acquisition factor. Even if it is considered an interest 
factor, it is not known whether it, defined as it is by tests of infor- 
mation about science, is the same as that found in numerous other 
studies loaded by questions asking about preference for science. 


The Inducing Rules test includes questions involving à knowledge 
of general science, and so its loading is reasonable, but there is no 


explanation for the loading of Interest in Art. 
Factor XII is probably General Reasoning, Universal Index No. 


T34: A factor which has been associated with arithmetic reasoning 
tests and a rather varied list of other reasoning measurements. Fig- 
ure Analogies was included in the battery to help define this factor. 


Its loading of 0.18 is lower than expected, but it is the highest load- 
ing for Figure Analogies on any of the factors. 

Factor XIII is an interest or personality factor. It could be called 
Interest in People or it may be similar to the Sociability factor 
found in personality inventory analyses. Factors having both of 
these interpretations have been found in many studies (3). 

Factor XIV is a combination of Visualization and Space, Uni- 
versal Index Nos. T14 and T11: As noted for Factor VIII, the in- 
ference here is that the tests used for the “Space” factor actually 
nted by both Visualization and Induc- 


require the abilities represe 
tion. Visualization may be defined as the ability to comprehend 
objects in space 80 as to manipulate them in the imagination. 
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Factor XV is the Number factor, Universal Index No. T10: 
Speed in carrying out simple numerical computations involving the 
four arithmetical processes. Mathematics reasoning tests ordinar- 
ily have a loading on this factor as well as on General Reasoning. 
In this analysis the mathematics variables do not load on Number, 
because Number and General Reasoning were rotated to a cor- 
relation of 0.36 with each other. 

Factor XVI has no apparent psychological meaning. It can best 
be considered a residual factor. 


CONCLUSION 


Tests of each of thirteen well-established factors and some tests 
of general information were administered to an entering class at 
West Point. Together with scores from the entrance examinations 
and first-year grades, these were factor-analyzed to confirm the 
identification of useful pure-factor tests by finding factorial in- 
variance when such tests are factored. 

All thirteen of the sets of tests used in the battery for the desig- 
nation of reference factors did in fact turn out to fall into clusters 
within the factor space. Twelve out of thirteen of the clusters de- 
fined factors as expected. Those three variables intended to repre- 
sent Space did not load a separate factor for themselves but did 
form a thirteenth cluster located in the plane defined by Induction 
and Visualization. The General Reasoning factor is included among 
the twelve that were separated successfully. However, the Mathe- 
matics Test (No. 4) and Figure Analogies (No. 5) do not define the 
factor well by themselves. The factor does become adequately 
EUN or because of the presence of Mathematics Grades 
E m of the fifteen non-residual factors make a 
Nie o prise m The Grades factor has often appeare 2) 
"Tho Interest & E course grades and tests were included. 
times in the ey aeria one that has been found wich 

ure (3). Leadership ratings and amount of in- 


formation on social activiti i i 
rmai les comb 
plication here is that the "pl lisa ien write 


lead i i : 1 d 
with sociability. adership ratings are heavily weighte 


From the standpoint of the study’s purpose, it may be concluded 


that factorial invariance held up, and, therefore, the selection of 
tests to represent certain factors was successful. The clusters are 


Factorial Invariance of Pure-Factor Tests 109 


sufficiently tight to make it reasonable to represent factors in a 
battery by one test from each set. Thus, the study served as & pre- 
test yielding factorial, reliability, and level-of-difficulty data upon 
which to base the selection of tests. 


REFERENCES 


1) R. B. Cattell, A universal index for psychological factors. Advance 
Publication No. 3, Laboratory of Personality Assessments and Group Be- 
havior, Univ. of Illinois, Urbana, Ill., 1953. 

2) J. W. French, “The Description of Aptitude and Achievement Tests 
in Terms of Rotated Factors,” Psychol. Monogr., No. 5, 1951. 

3) J. W. French, The Description of Personality Measurements in Terms 
of Rotated Factors. Princeton, Educational Testing Service, 1953. 

4) J. W. French, (Ed), Kit of Selected. Tests for Reference Aptitude 
and Achievement Factors. Princeton, Educational Testing Service, 1954. 

6) L. L. Thurstone, Multiple-Factor Analysis. Chicago, Univ. of Chicago 
Press, 1947. á 

6) Ledyard R. Tucker, “A Semi-Analytical Method of Factorial Rotation 
to Simple Structure,” Psychometrika, 9: 43-68, 1944. . 

7) H. Woodrow, ^The Relation Between ‘Abilities and Improvement with 
Practice,” J. educ. Psychol., 29: 215-230, 1938. 

8) American Documentation Institute. A more detailed form of this paper 
(or extended version, or material supplementary to this article) has been 
deposited as Document number 5163 with the ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress, Washington 25, D. G. 
A copy may be secured by citing the Document number and by remitting 
$135 for photoprints, or $1.25 for 35 mm. microfilm. Advance payment is 
required. Make checks or money orders payable to: Chief, Photoduplication 


Service, Library of Congress. 


RETENTION OF GENERAL PSYCHOLOGY 
W. J. McKEACHIE and DANIEL SOLOMON 


University of Michigan 


Whenever a group of teachers gather to commiserate about the 
futility of trying to educate the average college student, someone 
is sure to say, “Within a week after the final examination the 
students will have forgotten three-quarters of what they’ve 
learned anyway!” 

Such statements, based on Ebbinghaus’s classic studies of 
memory have seldom been tested in terms of the amount of gain 
in knowledge during a course retained over a period of time 
following the course.! Ebbinghaus, himself, discovered that mean- 
ingfulness made for improved retention. We might, then, expect 
retention to vary somewhat from course to course as well as 
individual to individual. 

If an instructor is teaching an advanced course, how much 
should he expect students to remember from previous courses? 
This study had the practical objective of determining how much 
of students’ learning in a course in general psychology was re- 
tained over a summer or more. 


Procedure 


. At the University of Michigan a course in general psychology 
1$ à prerequisite for the course in educational psychology. In the 
fall of 1954 most of the students in the educational psychology 
course had taken the general psychology course either a year 
earlier or the preceding spring. Accordingly, a forty-item multiple- 
choice test was constructed from items on the general psychology 


fin inati j 
= tira for those two semesters. This test was given 
e i i 
nts in the educational psychology course during the first 
*Most studies of rete: 
res on an examinatio 
test at some later dat 
Students migh 
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TABLE I.—PERCENTAGE OF CORRECT ANSWERS TO 
EXAMINATION ITEMS 


End of | Besinning 
T— | eee Education-| Per cent 
EY al ERN Retention 

ogy Course 
Course |Sept., 1954 


19 items common to June, 1954, and) 54.03 73.5 70.7 85.6 
Educational Psychology tests 

18 items common to Feb., 1954, and| 41.12 
Educational Psychology tests 


78.0 TAL 81.3 


week of the fall semester. Thirty-five of the students entering 
the educational psychology course had also taken the final the 
previous June. There were nineteen items common to the two 
tests. Eighty-one students had taken the final the previous Feb- 
ruary. There were eighteen questions common to these two tests. 
In June the mean score on the nineteen common items was 
14.0. In September the same students averaged 13.4. While test 
papers were not available for the February examination, item 
analysis data gathered for all students in the course indicated 
that the mean on the eighteen common items had been 14.0, while 
the mean score of eighty-one of these students in educational 
psychology eight months later was 12.8. Since students entering 
education are on the average no better than the average student 
in our course, this too indicates a high degree of retention. 
This was à much higher degree of retention than we had an- 
ticipated. We began to wonder whether or not our results could 
be explained by the hypothesis that our test scores were not 
really measuring course achievement. Perhaps the students would 
have done as well on these items even without taking the course. 
To check this we gave the same test to à group of students at 
the beginning of the course. The results showed their scores to be 
significantly lower than those of the educational psychology 


students. i 
As Table I indicates, if we take the gain in test score from 
beginning to the end of the course as measure of learning, re- 


tention over a four to 8-month period is more than eighty per 


the help of Dr. Robert Dixon in arranging for 


ger 
? The authors appreciate z o 
Snoek assisted in 


administration of this test. Charles Smith and Diedrich 


test scoring and analysis. 
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cent. While the possibility remains that the test measures some 
general gain in sophistication independent of the course, it seems 
fairly safe to conclude that material learned in the first course 
in psychology is retained for an appreciable period after the final 


examination. 
REFERENCE 
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ASSESSMENT OF INTELLIGENCE LEVEL BY 
THE RORSCHACH TEST AS COMPARED 
WITH OBJECTIVE TESTS 


BOLESLAW A. WYSOCKI 


Marquette University? 


Rorschach wrote that his test “... presents an intelligence 
test almost completely independent of previous knowledge, 
memory, practice and degree of education” (2, p. 183). This high 
estimate of blots as to assessing intellectual abilities is shared, 
almost unopposed, by most of the Rorschach experts. They contend 
that intelligence is one of the most important factors that can be 
measured with a great accuracy by the Rorschach test. Thus the 
purpose of this paper will be to find out to what extent the group 
method of administration proves or disproves Rorschach's well- 
known contention concerning the assessment of intelligence level 
by his ink blots. 

"TESTS - 

The following tests were used ih the present study: (1) Group 
Rorschach, scored after Klopfer (2; 8, 4); (2) Verbal Intelligence 
Test, “S.P. Test 15," used by the British Army and consisting of 
one hundred equally weighted items; (3) Nonverbal Intelligence 
test, Raven's Matrices, consisting of sixty equally weighted 
items. : 

SUBJECTS 


In order to make comparisons with Rorschach categories, the 
verbal and nonverbal intelligence tests were given to one hundred 
and thirty-two men and eighty-five women. The ages ranged from 
eighteen to forty-six with an average of 27.9 years for men, and 

' from sixteen to forty-three with an average of 21.5 years for women. 
The average educational background was grade 10.3 and 9.6 for 
men and women respectively. 


1 Now at the Alliance College, Cambridge Springs, Pa. 
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TABLE L—CoRRELATION COEFFICIENTS BETWEEN RORSCHACH 
CATEGORIES AND INTELLIGENCE TESTS 


Verbal Intelligence Test Nonverbal Intelligence Test 
Rorschach 
Categories 

Men Women Men Women 
No. of R 0.440 0.389 0.389 0.450 
W% 0.239 0.331 0.329 0.112 
M% 0.272 0.368 0.266 0.389 
F +% 0.356 0.111 0.129 0.111 
F- 96 —0.115 —0.174 —0.083 —0.053 
A% —0.291 —0.205 —0,124 —0.184 

RESULTS 


In order to establish validity, if any, of the group Rorschach 
method in assessing intelligence level, the several Rorschach 
categories were correlated with the two intelligence tests.? In all 
cases coeffieients were calculated by the full product-moment 
formula (Pearson) and are shown in Table I. 

Correlations between the several Rorschach categories and the 
intelligence tests of 0.228 and 0.283 are significant at 0.01 level for 
men and women respectively; whereas correlations of 0.174 and 
0.217 are significant at 0.05 level for men and women respectively. 

As Table I shows the highest positive correlation with intelli- 
gence tests, significant at 0.01 level (both the verbal and nonverbal) 
is N of R, that is, total number of responses for men and women. 
The second category showing positive correlation at 0.01 level of 
significance with both tests is W, a complete whole response. Next 
to the W is the M category (human movement), showing positive 
correlations at 0.01 level of significance with both tests for mer 
and women. Good form F + %, gives only one positive coefficient 
significant at 0.01 level with the verbal intelligence test for the 
male group. The poor form, F — %, gives negative correlations 
not significant at 0.01 or 0.05 levels. Animal responses, A, di 
correlate significantly with the intelligence tests at 0.01 level for 
men and 0.05 level for women. 

Next, the possibility of combining the scores in the various 
Rorschach categories was taken into consideration to see if 8 


? Coefficient of correlation between the verbal and nonverbal intelligence 
tests was 0.707. 


l————— —————— ——— a 


VA. 
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TABLE IL—INTERCORRELATIONS OF INTELLIGENCE TESTS 
AND RORSCHACH CATEGORIES 


ue SR we | aes ren | ntm m 
N. verbal Int. Test | 0.707 | 0.424 | 0.273 | 0.203} 0.193|—0.052|—0.191 
Verbal Int. Test — | 0.409 | 0.290 | 0.179| 0.209|—0.146|—0.229 
No. of R — | 0.384 |—0.083|—0.054| —0.007|—0.222 
W% — .092| 0.064| 0.093|—0.196 
M% — 0.334|—0.140|—0.209 
F+% — |—0.503| 0.087 
F-% — 0.044 
A% — 


combination would be better than the best single score. For this 
purpose the Wherry-Doolittle test selection method was adopted 
(1, p. 435) to satisfy need. Thus, the matrix of intercorrelations 
between the two intelligence tests and six Rorschach possible 
indicators of intelligence was computed. The correlations are 
simple product-moment coefficients for the whole group of subjects, 
that is, for one hundred and thirty-two men and eighty-five women 
tested in the study. The correlation matrix is given in Table II. 
Correlations of 0.181 are significant at 0.01 level and correlations 
of 0.138 are significant at 0.05 level. 

Next, corrected multiple correlations were calculated as follows: 


Rv.1 =0.409 Rn.1 0.424 
(R of 0.181 is significant at 0.01 level) 
Rv.14 =0.464 Rn.13 =0.492 


(R of 0.212 is significant at 0.01 level) 
Rv.146  -0487 Rn.134 — =0.499 
(R of 0.234 is significant at 0.01 level) 


=0.487 Rn.1345 = =0.503 


Rv.1463 Saul 

(R of 0.253 is significant at 0.01 level) 
Rv.14632 =0.490 Rn.13456 —0.505 

(R of 0.269 is significant at 0.01 1 
Rv.146325 —0.487 Rn.184562 —0.505 "e: 


(R of 0.283 is significant at 0.01 level) 


In the above-listed multiple correlations 1 stands for No. of R; 
2 for W%; 3 for M%; 4 for F+ 96; 5 for F— 25; and 6 for AG, 
The symbol v represents the verbal intelligence test, and n dé 
nonverbal intelligence test. 

m the review of these multiple correlations one may observe 
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that by adding several Rorschach categories to the first one selected 
(with the highest correlations with intelligence tests) substantial 
gains are obtained increasing multiple R from 0.424 to 0.505 and 
from 0.409 to 0.490 for intelligence tests, both nonverbal and verbal, 
respectively. On the basis of the above multiple correlations it 
may be noted that, when taken together, four Rorschach cate- 
gories, namely number of responses, human movement, good form 
and animal responses, constitute the best combination of intelli- 
gence predictors, as measured by intelligence tests. 


SUMMARY AND CONCLUSIONS 


An attempt was made to determine the validity of the group 
Rorschach in assessing the level of intelligence. In the first stage 
of this investigation, the group Rorschach results were compared 
with two intelligence tests, verbal and nonverbal, for men and 
women separately. Significant correlations, ranging from 0.272 
to 0.450, were found for the following Rorschach categories: total 
number of responses, the percentage of whole responses, the 
percentage of human movement responses, good form per cent, 
and the percentage of animal responses. The correlations between 
the poor form level and intelligence tests were found not significant 
when calculated for men and women separately. 

Further, a consideration was given to the possibility of exploring 
combinations of several scores instead of single ones. Thus, after 
computing the correlation matrix for men and women together, 
several corrected multiple correlations were calculated. It appeare 
that by successive additions of single Rorschach scores the multiple 
correlation was significantly increased up to 0.081 R points. It also 
turned out that the following Rorschach categories constituted the 
best combination of scores for predicting level of intelligence: 
number of responses, human movement, good form, and animal 
responses. 


These results confirm those of Wishner (6) with regard to the 
individual Rorschach. 
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BOOK REVIEWS 


SaMuEL J. Beck. The Six Schizophrenias. New York: American 
Ortho-Psychiatric Association, 1954, pp. 237, plus viii. 


An important contribution to Rorschach researches is Dr. Beck's 
volume on the six schizophrenias. The study purports to be, and 
is, a coórdinated inquiry into the problem of schizophrenia as 
carried on by the three scientific disciplines—psychology, social 
work and psychiatry. This is the orthopsychiatric pattern for the 
multi-discipline approach in the study of human behavior. The 
researches, one with children and one with adults, were conducted 
in 1947-52 in the Michael Reese Hospital in Chicago. The aim 
of the research group has been to describe the schizophrenic 
person by operational trait data which are clinically and psycho- 
dynamically meaningful and to inspect these descriptions by 
sound method. 

Six patterns of dynamic organization of the traits emerged and 
the volume is chiefly concerned with a description of details of 
the study and the results found. 

Dr. Ray R. Grinker contributes the initial chapter called 
“Clinical Introduction” and in it he describes the step-by-step 
build up of the research and the clinical significance of it for re- 
search and therapy. Considered in terms of methods and ap- 
proaches, the research includes: (1) The analysis of a sizeable 
series of normal adults and children—sixty children, fifty adults; 
(2) an analysis of the clinical histories and the Rorschach tests 
of non-schizophrenics; (3) an analysis of clinical history and 
study of clinical formulations with Rorschach correlates. 

The Stephenson Q technique used for sorting of clinical data 
and the corresponding Rorschach data on each case is considered 
in a special chapter included concerning Q methodology by Prof. 
Stephenson. 

It is Dr. Grinker’s opinion that the researchers give us not only 
a better idea of dynamic process of schizophrenia, a better way of 
understanding the process of disintegration and attempts at re- 
integration, a better predictive of diagnostic view of schizophrenia, 
but also a much more satisfactory way of viewing the personality. 
Dr. Beck perceives the researches as having four implications: 
(1) Describes the schizophrenia as certain over-all dynamic ad- 
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justment patterns; (2) emphasizes latency in schizophrenia that 
is not necessarily psychosis; (3) bimodality; (4) postulate of 
least resistance. And he is happy about the fact that the test as 
a tool has withstood the reactions of the tough minded group of 
psychiatrists that helped in his research. 

There is no questioning the value of these researches for stimu- 
lating further research, as well as adding to our knowledge con- 
cerning the Rorschach as a tool in the studying of personality. 
What can be questioned is the practicality of the approach in 
terms of making the most useful contribution towards developing 
insights for understanding and treatment of schizophrenia. 

However, the frame of reference for the researches is under- 
standable enough considering Beck’s persistent interest in the 
Rorschach test. 

H. MELTZER 


Human Relations Research Foundation, St. Louis, Missouri 


Detos D. Wickens and Donatp R. Meyer. Psychology. New 
York: Dryden Press, 1955, pp. 541 plus vi. $5.25. 

Irwin A. Bera. Student's Workbook. New York: Dryden Press, 
1955, pp. 181 plus v. 

Harry P. Bannick. Teacher's Manual. New York: Dryden Press, 


1955, pp. 70. 


Another text in Psychology is this book by Wickens and Meyer. 
In it the authors endeavor to present the essential facts about 
behavior as they are now known and also a systematic method 
of explaining which they consider the most fruitful, rigorous and 
promising. As far as intellectual orientation is concerned, the 
chief emphasis in this book is on the importance of learning and 
the use of learning concepts in the analysis of complex forms of 
behavior. The early chapters of the book deal with learning 
concepts and are captioned "Learning Fixation" and "Utilization 
of Learning." These are followed with chapters dealing with basic 
concepts in areas of motivation and perception, social behavior 
and personality development, dealt with in the light of the basic 
learning concepts as presented in the earlier chapters. The last 
few chapters deal with physiological foundations of behavior and 
a very brief last chapter represents an attempt to sum up. Each 
chapter is followed by a list of references. The book contains an 
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index. The publishers have done an excellent job in printing and 
preparing a book of attractive appearance. 

This new psychology text includes the usual chapters, though 
somewhat rearranged. Diagrams and pictures indicate a fair 
amount of resourcefulness used in putting the volume together to 
make ideas considered clear. References are relatively of a more 
recent origin. Behavior is described by the authors as “responses 
made to stimuli acting on the individual at the particular moment. 
Because of the individual’s biological structure and the nature of 
his past experience, these stimuli will evoke certain psychological 
mechanisms—such as habits and motives, to mention only two— 
and certain kinds of responses will follow.” This approach, the 
authors say, does not deny conscious experience, it simply states it 
will not use conscious experience as the main source of explana- 
tion. In short, what they are saying is that they are dominantly 
behavioristic. 

Academic newcomers in the field are likely to be less critical 
of the general content and emphasis. People who have grown with 
the field are likely to miss certain experiments, illustrations and 
conclusions that they are used to seeing and thinking of as being 
fundamental in presentation of the subject of psychology. People 
who are psychoanalytically oriented will not find the content or 
emphasis in this book satisfying. Their treatment of “learning to 
learn,” therapists and clinicians will find lacking in an emphasis 
on need for unlearning. In the authors’ consideration of forgetting, 

they cover qualitative experiments in forgetting. Qualitative in 
the sense of learning and forgetting more meaningful life experi- 
ences is not mentioned by them. Homeostasis is used in a com- 
pletely uncritical manner. Persons who think that it is well for 
era uaa Something about historical foundations of lon 
i & some of the founders, will find the book disaP 
pet Thorndike's name, for example, i$ aly 
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end of the book are relatively well done, but the kind of material 
students expect to find in psychology books is treated in less 
humanized fashion than needs be. All in all they have done well 
what they consciously strived to do. And omissions and shifts 
of interest obviously are intentional. It is a safe book to use for a 
psychology text rather than the most advisable one. 

Student’s Workbook for use with Psychology by Wickens and 
Meyer is prepared by Irwin A. Berg. It includes a listing of con- 
cepts, terms and objective test exercises of each chapter. It also 
contains some demonstrations, exercises and formal statements 
for demonstrations and projects. 

Teacher’s Manual to accompany the Wickens and Meyer book 
is prepared by Harry P. Babrick of Ohio Wesleyan University. 
It includes a list of text questions and discussion questions for use 
by teachers. It also contains an annotated bibliography of recent 
literature relevant to the teaching of general psychology. 

H. MELTZER 


Human Relations Research Foundation, St. Louis, Missouri 


Miron Greensiarr, Ricnarp H. York, and EsruER LUCILE 
Brown. From Custodial to Therapeutic Patient Care in 
Mental Hospitals. New York: Russell Sage Foundation, 1955, 
$5.00, pp. 497. 


“From Custodial to Therapeutic Patient Care in Mental Hos- 
pitals” is a publication of the Russell Sage Foundation, which was 
established in 1907 “for the improvement of social and living 
conditions in the United States of America.” The psychiatrist on 
the team, Dr. Milton Greenblatt, writes the chapters on “Towards 
Therapeutic Community” and with the social worker, E. Brown, 
writes the conclusions. The psychologist, Richard York, writes 
Part II, which is concerned with experiments in ward patient care 
in Veterans Administration Hospitals, and Part III where the 
problem of motivating patients on chronic wards at Metropolitan 
State Hospital is considered. The book also contains an excellent 
bibliography on social treatment compiled by Dr. Frederick L. 
Wells. It also contains an index. 

That mental hospitals are behind times is no news for people 
in the field and for an increasing number of other people. The 
hospitals whose function it is to administer to the sick minds, 
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ly three quarters of a million 
patients, continue to struggle with buildings of which some are 


nor experience. The purpose of this volume is not to tell a weeping 
story of neglect for mental hospitals, but to present a description 
of work done and work that can be done to reduce the present 
confusion and discouragement. It is basically an exploratory re- 
view in social treatment, The attitude of the authors is eclectic 
in nature. As they see it, the function of a mental hospital is to 
utilize every form of treatment available for restoring patients to 
health. And at worst, to aid them to live as nearly normal lives 
as possible within an institutional setting. The goal of the authors 
is to use both the personnel and physical resources of the hospital 
to aid in the development of competence in the patients, The de- 
scription of ward and ward treatment included will be of interest 
to all people working with institutional cases or in institutions. It 
will be of particular interest and use to administrators of mental 
hospitals. In their assessments of hospitals, collaboration with 
community sources and resources, the authors show a nice balance 
between the sense of limitations 
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state governments, sights are being raised and at least the be- 
ginnings of promise are in view. The question facing us now, as 
described by the authors, is “whether hospitals will strive to 
maximize the use of such physical and social resources as exist, 
including resources in patients, to achieve this end; and whether 
the American public will become a more active partner in this 
venture by greatly increasing contributions and sustaining interest 
and service.” 

The description of the material here, as well as the attitude ex- 
pressed should be of help to all people connected with or working 


with mental patients. 
H. MELTZER 


Human Relations Research Foundation, St. Louis, Missouri 


C. WiuriELD Scorr and Crypg M. Hir. Public Education Under 
Criticism. New York: Prentice-Hall, 1954, $4.75, pp. 414 
plus xiv. 

Henry Enters. Crucial Issues in Education. New York: Henry 
Holt & Co., 1955, $2.25, pp. 277. 


To the wave of criticism of our public schools and education in 
general, there is only one answer—more and better education. 
And that includes education of educators and interested laymen 
to an awareness of the meaning and significance of the issues. By 
so doing, the editors of the two anthologies on education hope 
to contribute toward the rescue of America from the retreat from 
learning advocated by critics. The two volumes being reviewed 
are both concerned with the problem of how best to handle the 
criticism of education and how to develop realistic attitudes for 
action to improve the educational scene. 

“Public Education Under Criticism" represents an attempt on 
the part of professional educators to consider the needs and the 
best ways of handling the criticisms of teachers, school adminis- 
trators and teaching techniques. It makes an honest effort to let 
opponents and proponents present their particular attitudes for 
the purpose of appraising the credentials of our educational phi- 
losophy as well as practices. Selections made are presented under 
five sections, considered as areas of criticism and ways of handling 
them. The major criticisms include progressive education, funda- 
mentals, religion, social studies, teacher education and teachers 
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ANIMISTIC THINKING AMONG COLLEGE AND 
HIGH SCHOOL STUDENTS IN THE NEAR EAST 


WAYNE DENNIS 
Brooklyn College, Brooklyn, N. Y. 


INTRODUCTION 


In an earlier publication (4) the writer reported that approxi- 
mately one-third of the students in several American colleges and 
universities attribute life to one or more inanimate objects. This 
finding has been corroborated by several other investigators (1, 2, 
6, 9). 

In comparison to many countries, the level of education in gen- 
eral, and the level of scientific knowledge in particular, are high in 
America. If animistic thinking is present in one-third of American 
college students, it seems likely that it is even more prevalent in 
countries in which the educational levels are lower, and in which 
natural science concepts and information are less widely dissemi- `, 
nated than in the United States. The present paper reports data 
relative to this deduction, derived from college and high school 


students in the Near East. 


PROCEDURE 

Each subject was given a mimeographed sheet containing the 
following instructions: 

"Tn connection with each of the objects listed below, indicate 
whether it is living or not living. Give a reason for each answer. 
Please answer the questions in the way that you think they would 
be answered by a biologist. For each object, the question is, is the 
object living in the same way that animals and plants are living?” 

Below the instructions appeared the names of seven objects. For 
the subjects who were college students the objects were: a lighted 
match, the sun, the wind, the sea, lightning, a pearl and atoms, On 
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the blanks given to high school students, “a river" was used in- 
stead of “the sea" and "petroleum" instead of "atoms." In the 
case of college students, the questions were presente 
in English, which was their language of instruction. 
ing of the high school subjects w 
questions were presented and ans 


d and answered 
Since the teach- 
as in Arabic, in their case the 
wered in that language. 


SUBJECTS 


The college students Were enrolled in a woman's college and a 
university in Beirut, Lebanon. Although these institutions are lo- 
cated in Lebanon, their students come from many Near Eastern 
countries, In the university, only male students were used as sub- 
jects. The majority were students in the college of arts and sciences. 
High school data were obtained from three public high schools in 
Iraq. Two of these schools were located in Mosul and one in Bagh- 
dad. The latter school and one of the Mosul schools were day 
schools; the other was a night school attended primarily by adults 
who had discontinued their earlier education before the comple- 
tion of high school. 

In the colleges the animism questionnaire was given in the intro- 
ductory psychology course. Since the majority of students at these 
institutions take this course, the sampling is believed to be repre- 
sentative of students at these institutions, In the high schools, the 
questionnaires were administered to first-, second- and third-year 
students in required courses, and consequently, the results are be- 
lieved to be representative of these levels in these schools. 


RESULTS 
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TABLE I.—NUMBER AND Per Cent OF STUDENTS 
GIVING ÀÁNIMISTIC ANSWERS 


Mosul 
Gole | College | Baghdad | Mowat Day Night | ‘Tots 

Number of 130 75 115 286 141 747 

students 
Per cent giving 77 53 95 81 74 79 

one or more 

animistic 

answers 


and science courses for the two sexes are identical, indicate no sex 
differences. Comparisons of Christian and Moslem students, not 
included in Table I, indicate these two groups do not differ in fre- 
quency of animistic answers. 

Table II indicates the frequency with which each object was said 
to be living by the members of each group. It will be noted that the 
lowest percentage obtained was for lightning. Among the univer- 
sity men only sixteen per cent stated that it was living. 'The uni- 
versity men yield a response frequency of twenty-one per cent for 
both the burning match and the wind. The highest per cents ob- 
tained were seventy-five per cent and seventy-eight per cent. In 
both cases these refer to the pearl, and the scores were made by the 


TaAnLE IL—Pzn CENT or RESPONSES TO Each QUESTION 
WHICH WERE ANIMISTIC 


College College Baghdad | Mosul Day |Mosul Night 
Women Men Students Students Students 
—————— 
Match 47 21 50 27 42 
Sun ^ 38 26 66 36 35 
Wind `; 34 21 56 24 35 
Sea 41 24 
Lightning 30 16 32 23 25 
Pearl 26 24 78 75 42 
Atoms 55 31 
River 54 28 42 
Petroleum 49 23 40 
Per cent of total 39 23 55 34 37 
answers animistic 
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TABLE III —RzasoNs Given ron ANIMISTIC ANSWERS 


College Women Baghdad Students 
Per cent Per cent 
Moves, changes, does something 39 38 
Supports or produces life 21 7 
Has a beginning and an end 14 4 
Grows 16 23 
Uses oxygen, produces energy 6 16 
Miscellaneous 4 12 


Iraqi high school groups. It will be noted that in four of the five 
groups one-third or more of the total responses were animistic. 
In the case of two groups, the college women and the Baghdad 
students, the reasons given for positive answers have been ex- 
amined and classified. The results are shown in Table III. It will be 


thirty-nine per cent and 


Soning is not incorrect; only the facts ane 
K The statem 
living becaus 
education do 
Apparently the students 
and they reasoned that 
College and 
which are seld 
American Colle 
l ge students 
1s that the Sun, or the sea 


Y children. The answers occur among 
(4) as well as in the present sample, One 
» OF some other object is living because it 
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produces, or is necessary for, life. This answer seems to result from 
the premise that like produces like, and therefore, that whatever is 
responsible for, or necessary for life, must itself be living. The sec- 
ond notion is that since life has a beginning and an end, all things 
which have a beginning and an end, as do a flame and a flash of 
lightning, are therefore living. These answers have not been re- 
ported by Piaget in his four stages of animism, presumably be- 
cause he questioned only children. They represent categories or 
“stages” in excess of his four. 


DISCUSSION 


It is obvious from the present study, as well as from earlier ones 
(3, 4), that Piaget’s assumption that animism is limited to children 
is incorrect. It seems probable that a degree of animistic thinking 
can be found in some adults in any society. The actual percentage 
of animistic replies will differ from question to question and from 
group to group. 

While the cultural area in which the present study was conducted 
was a center of one of the earliest civilizations, extensive modern 
educational programs have been introduced only recently. Science 
education, as compared to linguistic, literary and historical in- 
struction, is undoubtedly still weak. These two facts probably ac- 
count in the main for the high incidence of animistic replies to our 
questions. 

We do not believe that the prevalence of animistic thinking in 
our subjects is due to positive cultural influences. Christianity and 
to which our subjects belong, both reject the view that ob- 
jects of nature and forces of nature are themselves alive. Both of 
these religions would reject such ideas as tending toward idolatry. 

We have indicated earlier (8) our belief that animism arises 
autogenously in individuals who lack an adequate background in 
modern biological concepts and knowledge. The persistence of ani- 
mism into the adult years, in the Near East, is to be accounted for 
of educational handicaps rather than in terms of a posi- 


Islam, 


in terms 1 
tive animistic tradition. 
SUMMARY 


Seven hundred and forty-seven Near Eastern college and high 


School students were questioned in regard to whether each of sev- 
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eral objects is living or not living. Seventy-nine 
or more animistic answers. In the various 
of animistic answers were much hi 
American students of comparable 
ferences are presumed to reflect th 
tific concepts and information in 


per cent gave one 
groups the frequencies 
gher than those obtained among 
educational placement. The dif- 
e differences in diffusion of scien- 
these populations. 
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EFFECTS OF PRAISE AND REPROOF ON 
READING GROWTH IN A NON- 
LABORATORY CLASSROOM 
SETTING 


HARRY F. SILBERMAN 
Division of Teacher Education 


Municipal Colleges of the City of New York 


This report compares teachers whose verbal behavior is charac- 
terized by different amounts and combinations of praise and re- 
proof. Praise and reproof are verbal incentives that have been 
found to enhance performance on learning tasks (1, 3, 8, 9). In 
some experiments praise has been superior to reproof in its effect 
upon performance (1). In other experiments, reproof has been 
shown to be more effective (2). Three sources of variation which 
might account for the conflicting results obtained in these studies 
are: (1) school or teacher differences, (2) motivational differences 
among pupils engendered by novel laboratory settings, and (3) 
differences in operational definitions of praise and reproof. 

The present study, using forty-nine teachers in grades three to 
six in nineteen schools, is designed to study the effects of praise 
and reproof in teachers’ verbal behavior on reading growth of 
pupils in non-laboratory classroom settings. It was the intention 
in this study to investigate not only praise and reproof but the 
combination of the two. The combination of praise and reproof 
is obtained by multiplying the praise scores by the reproof scores. 
A convenient way of interpreting this product variable is provided 


by the following table: 


PRAISE 


REPROOF 
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This table shows four possible combinations of praise and re- 
proof in teachers’ verbal behavior. If the PR variable, (praise-re- 
proof combination), is significantly related to the reading growth 
criterion, teachers in the “a” quadrant differ from teachers in the 
"d" quadrant in promoting reading growth provided other rele- 
vant factors have been controlled. 

Two additional variables, teachers’ verbal output as measured 
by the number of statements teachers make and the relative 
amount of time spent on reading skills, were also chosen for study. 

The hypothesis of this study, stated in the null form, is that 
measured reading growth is unrelated to a weighted combination 
of the relative amounts of praise, reproof, combined praise by re- 
proof, verbal output, and the observed amount of time spent on 
reading skills. If this hypothesis is rejected, the alternatives are 
that either praise, reproof, praise by reproof, verbal output, time 

on reading skills, or some combination of these five independent 
variables, is related to measured reading growth of pupils. 


PROCEDURE 
Sample. This study was condu 
55 using twenty-three third-grade classes 


municipal colleges of New York 
Queens, D 


City: City, Hunter, Brooklyn and 
n about the sixteen hundred and 


year. Each teache sat only one 
obs : T Was visited by 
Mis en js e An observational schedule and rating form, de- 
Settee Fda "5 à previous report, (5) was developed 
tion schedule with Ses observers, The dimensions in the observa- 


The“ ; a mphasis." 
€Xpressive beha dimension, adapted from Withall 


= ee 
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(11), consists of a group of seven spaces on an observation card 
for tallying teachers’ verbal behavior and certain non-verbal be- 
haviors. The half-hour visits were divided into six five-minute ob- 
servation periods. During the second, fourth, and sixth five-minute 
observation period the observers tallied every remark the teacher 
made in one of the following categories of the “expressive behav- 
iors” dimension. 


(1) Supportive statements. Teachers’ remarks which trained observers 
construe as approval of the pupils or of their responses are tallied in this 
portion of the card. For example, comments such as “Right,” “Good,” “Fine 
work,” which inform the pupil that he or his responses adequately satisfy 
the standards which the teacher expects the pupil to meet, are classified as 


praise. 
(2) Problem-structuring statements. Statements in which the pupil is 
offered facts, ideas, opinions, or questions in an objective non-threatening 


manner. 
(3) Neutral statements. Statements not classified in any other category. 
(4) Directive statements. Statements that limit the pupils’ choice of be- 
haviors. Examples are: “Answer the questions on page 12” and “Pass the 


papers forward.” 

(5) Reproving statements. Tallies of hostile, disapproving verbal be- 
havior which the observers construe as threatening to the pupil. “You're all 
wrong” and “Won’t you ever learn?” are illustrations of reproving statements. 


During the first series of classroom visits the six observers made 
a new tally on this dimension only when the teacher’s remarks 
shifted from one category to another. During the second series of 
classroom visits the six observers tallied the frequency of occur- 
rence of statements in each category. 

The “subject-matter” dimension of the observation schedule is 
a group of ten categories for tallying the subject areas in which 
the activity of the class is centered. One tally was made in this 
section by each observer during every five-minute portion of the 
half-hour observation period. The subject areas are: reading, 
mathematics, language arts, social studies, science, recreation, arts 
and crafts, music, social processes and test periods. 

The first series of observations was arranged during February 
and March of 1955. Teachers were assured on several occasions 
that the observer was simply recording what went on in the class- 
rooms and was in no way judging or rating the teachers. They 
were told of the purely scientific interest of the investigators and 
were repeatedly assured that all information obtained was entirely 
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confidential. The teachers were notified in advance of the date and 
approximate time of each visit, and sometime near the appointed 
time an observer unobtrusively entered the classroom, taking a 
seat at the rear of the room. The observer proceeded to observe 
and place tallies on the observation schedule card reserved for that 
teacher, employing a stop-watch to time the five-minute portions 
of the half-hour visit. The second series of visits was arranged 
during March and April, 1955. 

The criterion (dependent variable). A detailed discussion of the 
development of the reading growth criterion is presented in another 
report (7) in the current series of research studies on teacher be- 
havior being performed by the Division of Teacher Education of 
the College of the City of New York (10). Reading ability of the 
pupils in all forty-nine classes was measured during November 
1954 and again during April or May of 1955 using the California 
Reading Test (Elementary): Word Form, Word Recognition, 
Meaning of Similarities, and Interpretation of Meanings. Form S 
of the California Test of Mental Maturity, Non-Language Section, 
was also administered during November, 1954. The reading growth 
indices to be used as a criterion in this study consist of the means of 
the final reading scores of the pupils in the teachers’ classes, ad- 
justed by covariance control for the mental maturity scores and the 
initial reading scores. This procedure effectively equates pupils 
with respect to mental maturity and previous achievement in read- 
ing. In the analysis of the data these reading indices were converted 
to normalized “T” scores, 

Composition of the inde 


pendent variables. This study entailed 
the use of six variables, 


l consisting of one dependent variable 
described above and five independent variables. 

' a five independent variables are operationally defined as 
ollows: 
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This is an estimate of the relative amount of classroom verbal behavior ex- 
hibited by each teacher. 

(5) Reading Time. Total number of tallies for each teacher in the reading 
section of the “subject-matter” dimension of the observation card. This con- 
stitutes an estimate of the relative emphasis or amount of time that each 
teacher devoted to specific instruction in reading. 


Preliminary analysis of the five independent variables indicated 
that the 49 teachers did not rank differently on the two series of 
observations on all but the reproof scales. As a result, the number 
of tallies on the two series were added for each teacher on all but 
the reproof scales. The reproof measures obtained on each of the 
two series of observations were given equal weight by converting 
the measures on each series to standard scores and then multiply- 
ing the derived scores by the standard deviation of the series with 
the largest variance. The obtained scores were then combined to 
form the reproof score for each teacher. 

In the analysis of the data all variates were converted to normal- 
ized “T” scores. Differences among teachers were significant at 
the 0.01 level on each of the six variables. The reliabilities of these 
variables when estimated by analysis of variance (2, 7), are shown 
in Table I. 

RESULTS AND DISCUSSION 

The intercorrelations among the independent variables in this 
study are shown in Table II. Both praise and reproof are sig- 
nificantly related to total verbal output. 

The relationships between the reading growth scores and the 
independent variables is shown in Table III. The multiple correla- 
tion is not significant at the 0.05 level; hence, this sample does not 
contradict the hypothesis that in the population there is no correla- 
tion between reading growth and a weighted combination of the 
five independent variables. In other words, the independent vari- 
ables do not predict the reading growth of pupils in this sample. 


I.—EsrrwATED RELIABILITIES OF EXPERIMENTAL VARIABLES 


TABLE 
Reading growth 0.87 
Praise 0.71 
Reproof 0.84 
Verbal Output 0.80 
Reading Time 0.64 
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TABLE II.—INTERCORRELATIONS or Five INDEPENDENT VARIABLES 


Praise | Reproof p ise by yaa Benang E 
Praise —.1605 | +.6370** | --.6057** --.1839 
Reproof +.6274** | --.4304** | — 0036 
Praise by reproof +.7973** | +.2050 
Verbal output +.2745 


Reading time 


** Significant at the 0.01 level. 


TABLE IIL—RrLATIONSHIPS BETWEEN READING GROWTH SCORES AND 
Five INDEPENDENT VARIABLES 


Partial Regression 
Independent Variables Zero-Order Cocfücients Standard Errors 


Correlation (Beta Weights) 


Praise +.0258 +.2806 5657 
Reproof —.1539 +.1125 . 5597 
Praise by reproof —.1282 —.4522 -7075 
Verbal Output —.0519 +.1432 .2574 
Reading Time — .1951 —.1929 .1616 


Multiple Correlation Ry,12315 = .2795. 


In view of the non-significant relationships between the criterion 
and the independent variables employed in this study, no definite 
conclusions can be drawn. Whether or not an increase in the num- 
ber of teachers observed, or other precautions designed to enhance 
the Precision of the experiment, would have produced different 
findings is a matter of conjecture. Perhaps the observed teacher 
une Were atypical due to the presence of the observers in 
ihe cec If such were the case, the biased observations w ould 
of ete independent variables that are not representative 
d sihi a the negative results would hardly be rape 
was the inabit should be noted in this study 


who issues large amounts of re- 
hat she is the victim of an emo- 
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produce a more permissive climate and more growth in reading. 
'These are problems for further research. 

The results of this study indicate that if there is a relationship 
between the verbal incentives and learning to read, it is a low re- 
lationship. If we accept this interpretation, certain implications 
for further exploration are evident. Many educators extoll praise 
and mild reproof as classroom incentives and offer a carte blanche 
endorsement of both as if praise and reproof contained some in- 
herent merit. Yet, praise and reproof, as defined in this investiga- 
tion, did not have appreciable effects on reading growth. Perhaps 
classroom situational variables not observed in this study are 
much more important to pupil growth than praise and reproof 
per se, The nature of the situations in which the incentives were 
employed and the nature of the pupils receiving the incentives 
may have been much more important variables to observe. At any 
rate, in this study praising and reproving remarks were tallied 
whether or not the incentives were contingent upon the correctness 
of pupils reading responses. The teachers verbal behavior during 
the teaching of the criterion skill as well as during the teaching 
of other skills was used. 

Granting these limitations, the present study does not justify a 
blank endorsement for the use of praise or reproof per se in the 
classroom, and points up the need for the verification of labora- 
tory experiments by observational studies of normal classroom 


behavior. 
SUMMARY 


Forty-nine beginning teachers were each visited twelve times 
during the 1954-55 school year and their verbal behavior was 
categorized and tallied. Pupils were given reading tests at the be- 
ginning and end of the school year. 

A multiple correlation was computed between reading growth 
and five independent variables, namely, praise, reproof, praise by 
reproof, verbal output, and time devoted to reading skills. Results 
of the analysis did not yield significant relationships between the 
independent variables and reading growth of pupils. 

A discussion of the limitations of the study and of possibilities 
for further research on this problem was presented. 
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EFFECT OF STEREOTYPED ATTITUDES ON 
LEARNING 


ROBERT E. EGNER and ALVAN J. OBELSKY 


One of the most recent and promising of the new developments 
in educational testing has been the work done in measuring stereo- 
typed conditioning in students. Students arriving in college are 
hardly the "tabula rasa," attitudinally speaking, that college 
educators seem to have long taken for granted. Each has been 
subject to a unique schooling of attitude conditioning in the 
family, church, neighborhood, and in pre-college education. Some 
have relatively fixed and naive opinions on topies about which 
they are usually not yet competent to express an opinion; others 
are more flexible and sophisticated and "liberal" in their attitudes. 
It is this sort of psychological trait that stereotypy testing is 
designed to measure. 

The development of the stereotypy syndrome and its measure- 
ment are of quite recent origin. The monumental study, The 
Authoritarian Personality, was the first large-scale and system- 
atic attempt to evaluate factors concerned with stereotyped con- 


ditioning, particularly where it concerned racial discrimination. 


This work, both because of the importance of its subject content 
and its methodological analysis, stimulated further work in other 
phases of stereotyped conditioning, most noticeably as it applied 
to political attitudes? Its implication for the educational field, 
however, had not been missed and within the past five years a 
considerable amount of work, primarily in the development of 
valid tests of stereotypy, has been done. It is interesting to note 
that the first, public presentation dealing with the relevance of 


iT. W. Adorno, et al, The Authoritarian Personality (Harper, 1950). 
Credit, of course should be given also to the earlier studies relating to 
the theory and measure of personality syndromes conducted by Henry 
Murray and his associates at the Harvard Psychological Clinic. H. Murray, 
et al, Explorations in Personality, (Oxford, 1938). 

? See especially: Studies in the Scope and Method of the “Authoritarian 
Personality,” edited by Chrisite and Jahoda (Harper, 1950). This book 
provides a critique of The Authoritarian Personality and, in addition, de- 
velops some interesting modifications and extensions of ideas contained in 
the earlier work. 
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stereotypy to the learning process dates only from 1952, while 
the earliest devised test for measuring stereotypy in students, 
that of the American Council on Education, appeared in 1951. 
Since that time “Inventory of Beliefs” tests, essentially adapta- 
tions of the American Council on Education test, have been used 
at a number of institutions, notably at the University of Chicago. 
During the past three years at Northland College the Form T 
test (used at the University of Chicago) has been given to students 
in a required junior-year philosophy course by Professor R. E. 
Egner, and the combined results analyzed and significant cor- 
relations examined. The results of these efforts will be briefly 
described in the following paragraphs. 

The Form T test used in this study is composed of an inventory 
of one hundred beliefs, of which sixty are common stereotyped 
expressions. The examinee is asked to indicate quickly his ac- 
ceptance or rejection of the statements. No way is provided for 
indicating a neutral position and all items must be answered. 
The test is designed to indicate only the direction in which the 
personality has been conditioned to move. The stereotyped per- 
sonality will show a marked tendency to accept the stereotyped 
beliefs and to reject the non-stereotyped statements; with the 
non-stereotyped personality, the two tendencies are reversed. A 
sample of the beliefs taken from the Form T test are shown be- 
low together with the appropriate key. 


(A) I strongly agree or accept the statement. 
(B) I tend to agree or accept the statement. 
(C) I strongly disagree or reject the statement. 
(D) I tend to disagree or reject the statement. 


(1) Literature should not question the basic moral concepts 
of society, š 


can find a job if he really wants to work. 


ill eventually explain the origin of life- 
A origin hat 
(4) No task is too great or too difficult when we know t 

God is on our side. 

~ *8, Goldie ral 

e re E "The Authoritarian Personality and jo m 

deos x the 60th Annual Convention of the Ameri d 

A recent and ; September, 1952, Washington, D. C. A Lalani 

more compli study dealing with problems of asi A 

; may be found j 10 of Methods 0. 

Personality Assessment by Stern, Stein uz prse rm Glencoe, 
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(5) A sexual pervert is an insult to humanity and should be 
punished severely. 

(6) The twentieth century has not had leaders with the vision 
and capacity of the founders of this country. 

(7) Other countries don’t appreciate as much as they should 
all the help that America has given them. 

(8) No censorship on the presumed morality of books and 
movies can be justified. 

(9) Our rising divorce rate is a sign that we should return 
to the values which our grandparents held. 

(10) Most intellectuals would be lost if they had to make a 
living in the realistic world of business. 

This test was administered over a period of three years to a 
total of two hundred and eighteen students enrolled in an intro- 
ductory philosophy course (junior level) which was required of 
all students at the college. The samples upon which the subsequent 


analysis is based represent students from this total who scored 


at the extremes on the test and who were also equated for intel- 
ing the 


ligence.* Twenty low and twenty high scorers, composi 
two samples, were selected for further investigation. The problem 
was to determine if any significant correlation existed between 
high and low scoring on the stereotypy test and performance in 
the various subject areas. If one interprets the significance of 
stereotyped conditioning on learning ability in terms of the num- 
ber of successful students in each sample the differences in per- 
formance between the two groups readily becomes apparent. The 
figures in parentheses indicate the number of students in each 
sample receiving à grade of “C” or above in the various areas 
of learning. 
Humanities Stereotype (12) Non-Stereotype (19) 
Social Science Stereotype (11) Non-Stereotype (17) 
Biological Science Stereotype (18) Non-Stereotype (17) 
Mathematics Stereotype (17) Non-Stereotype (12) 
Natural Science Stereotype (16) Non-Stereotype (12) 
. It is clear that the non-stereotypes performed appreciably better 
in the Humanities and in the Social Science subjects, while in 
Mathematics and in the Natural Sciences the performance of the 


‘In both samples students were selected whose performance on the 
A.C.E. Psychological test were approximately the same—individual scores 
were located within a range of eight centiles. 
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stereotypes was superior. In the Biological Sciences both groups 
did about equally well. In view of the observation that a higher 
percentage of stereotypes tends to drop out in the first two years 
of college it is quite possible that a selective element is present 
in the sampling. Any adjustment for this, of course, would im- 
prove the showing of the non-stereotype group. — 

The above results, in spite of the small sampling taken, are 
closely in line with the observations made from similar studies 
at other institutions. The student most likely to succeed in courses 
which involve relatively high levels of flexibility of outlook is the 
non-stereotype. As one moves from the concrete to the abstract 
a larger percentage of stereotyped students begins to manifest 
evidence of frustration, culminating in more frequent academic 
failure, once their threshold of tolerance has been exceeded, Fig- 
ures would most likely underestimate their rate of failure since a 
substantially larger number of stereotypes withdraw from college 
before eventual failure. According to the University of Chicago 
examiner, Dr. Benjamin S. Bloom, about one fourth of the stereo- 
typed students drop out of the College of General Education by 
the end of their first quarter. The reasons given for these with- 
drawals provide a clue to the kind of atmosphere in which the 
stereotype thrives. Some of the most common follow: The educa- 
tion was not very practical; assignments were never very definite; 
the discussion technique represented a poor educational method 
(apparently because it lacked an authoritarian teacher) ; definite 
and final answers to problems in the humanities and the social 
Sciences were never given. 

An educator who is reasonably alert to the performances of 
his individual students will have little difficulty in recognizin& 
the presence of stereotyped attitudes and opinions among n 
Students. Evidence ean be found in abundance in written examina- 
tions, partieularly in the essay type where the student enjoys 
more or less free reign in putting down his thoughts. one a 
Sponses to questions and private discussions with the instructo 
—__. 


» ups are 
_ In Humanities and Social Science the differences beween rs ie ds 
Significant at the 5% level: in Mathematics the difference is just be 
tical significance, i North 
? From an address delive; i pd 
í red at the Workshop in Higher Edt i 
Central Association Stud: i iae iversity of Chicago. 
ee ee Y of Liberal Arts Education. Un: 
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indeed, any form of communieation—will reflect these attitudes. 
As long as the stereotype is communicative, and it is probably the 
case that as a group they are more inclined to be so, his char- 
acteristics can be readily observed. The “true” stereotype is 
entirely convinced, at least on an emotional plane, of the cor- 
rectness of his views and so does not hesitate to express them. 
This is true of the “left” and “right” stereotypes; both insist on 
the correctness of their respective positions which are located at 
the extremes of the continuum line. To take an example from the 
economics class, a “left” stereotyped student will tend to view 
the capitalist system as one characterized by a highly unequal 
distribution of income and unequal economic opportunities for 
its members, and exploitation of the working classes by those in 
economie power. Those on the “right” consider the institution of 
private property a sacred mainstay of freedom, both economic 
and political, with inequalities a necessary driving foree in the 
economy, and view the influence of labor as baleful to freedom 
and restrictive of the natural working of the economie system. 
It would not be difficult to find comparable examples of "left" 
and “right” stereotypy in the other academic disciplines. 

This “black” and “white” mentality of the stereotyped student 
can be so easily recognized (at least by an instructor who does 
not share the same species of stereotyped attitude) that a ques- 
tion very quickly arises concerning the usefulness of an elaborate 
test to detect its presence. The test clearly is of greatest use 
where large numbers of students are involved whose attitudes are 
as yet unknown. Under these circumstances it is convenient and, 
at the same time, it permits a rough quantitative appraisal of 
the degree of stereotypy present, At a large institution it could 
be given as part of a battery of tests and, if desired, correlations 
among performances on the stereotypy and other tests could be 
readily found. In the smaller school, where classes are generally 
limited and contact between students and instructor more in- 
timate, the value of the test is considerably diminished. 
Bo et eae 

le-8 c . One might think that 
this is a task for the psychological advisor, yet changing fixed 
and inadequate beliefs of students is a constant function of educa- 
tion. The extreme stereotype who holds to irrational beliefs with 
an obsessive force might, perhaps, be put down as a case in 
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pathology. But the more common stereotype presents a problem 
for the educator which differs only in degree from that which is 
faced in the ordinary affairs of education. Viewed in this light, 
the stereotype becomes simply a more intractable case to handle, 
though still amenable to the usual techniques of effective teach- 
ing. “Bare” facts, served in imposing dosages, cannot be ignored 
by the student for very long, and if the patience of the instructor 
is not lacking it should be a matter of time before a strongly 
entrenched stereotyped attitude begins to break down. Removal 
of the student from the emotional environm 
stereotyped conditioning would, of course, c 
to the progress, 

The entire problem of stereotypy has such broad implications 
for society as a whole that it seems superfluous to attempt to 
offer further justification for our concern with this subject. Studies 
of the authoritarian personality, which deal with a rather specific 
form of stereotypy, give adequate evidence that the fate of our 
democratic institutions themselves is at stake in this issue, The 
work now being done on stereotypy and the learning process 
should convince educators that they are in a more potent posi- 


tion to advance the cause of democratic society than might have 
appeared, 


ent fostering the 
ontribute a great deal 


LOGICAL VERSUS EMPIRICAL SCORING 
KEYS: THE CASE OF THE MTAI 


N. L. GAGE 
Bureau of Educational Research 


College of Education, University of Illinois 


The Minnesota Teacher Attitude Inventory (MTAI) (1), has 
scoring weights which are non-monotonically related to degree of 
agreement with the item. This is true of ninety of the MTAI's 
one hundred fifty items. That is, the same scoring weight is as- 
signed to responses on both sides of the *undecided" position, or 
different scoring weights are assigned to responses on the same side 
of the “undecided” position. Such items can be identified wherever 
the 0 weight falls somewhere other than between the +1 or —1 
weights, or wherever two or more +1 or —1 weights are in non- 
adjacent alternatives. (Non-monotonic weights would not be so 
puzzling if they proceeded +1, —1, —1, —1, +1, or —1, +1, +1, 
+1, —1. There are, however, no items like this in the MTAI.) 

For example, consider Item 1, “Most children are obedient.” 
The scoring weights for this item are SA, +1; A, 1; U, 0; D, —1; 
SD, 0. It is difficult to make psychological sense out of such scor- 
ing weights. The general direction or trend of the weights for all 
items does indeed assign positive weights to a permissive, child- 
centered orientation. But the non-monotonic weights defy any 
such conceptualization. And they imply that dimensions other 
than degree of agreement with the items are relevant to the test’s 
purpose. 

Two defenses might be offered for these non-monotonic weights: 
(a) they tend to defeat attempts at faking; (b) the scoring weights 
are based on empirically determined differences between the re- 
sponses of one hundred very good and one hundred very poor 
teachers. 

Both these defenses can be pierced by available evidence. Thus 
the MTAI has indeed been found to be fakable* (4), despite the 
ae and apparently illogical scoring weights of many of the 
items. Second, experience has shown (3, p. 450) that very large 


* At least in one sense. 
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prising if the MTAI’s non-monotonic weights 


e that the non-monotonic, empiri- 
cally derived scoring weights have lower validity than a logical 
set of scoring weights. To test this hypothesis, two “logical” scor- 
ing keys were made: (a) a dichotomous-logical key in which both 
choices on a given side of “undecided” were weighted +1, and the 
remaining three choices were weighted 0; (b) a trichotomous- 
logical key, in which the same choices as in a were weighted +1, 
but the two choices on the other side of “undecided” were weighted 
—1. In each instance we weighted --1 that side of the response 
continuum which contained the +1 weight in the published key. 
The following item illustrates the three keys whose validities could 
be compared: 


1. Most children are obedient. SA A U D SD 
Published key +1 -1 0 -1 0 
Dichotomous-logical key +1 +1 0 0 
Trichotomous-logical key Th di qo ay = 


rating scale. 
alf reliabilities of the scores 
keys were higher (0.93 and 0.94) 
btained with the empirical key (0.90). Hen¢® 
We do not need to depend on the empirical key to measure some- 
a consistent over items. — 
: cient of the tric -logical key 7 
$52 thus that of the published eerie es as oem 
Drs mj t that the logical key had a "e Wer 
ner alidi teworth that it did not have lo 
validity, y that it 


Table I shows that the split-h 
obtained with the two “logical” 


These findings have p ; ; importance, 
oth practical heoretical imp 
Tn future use of the MTAT, the Published bi aya key should be 
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TABLE I.—RELIABILITY AND VALIDITY OF THREE TYPES OF 
MTAI Score 
N = 97 


r with 
M S.D. id Criterion 


Criterion (Mean rating by pupils on| 20.7 | 10.4 | .91* 
“My Teacher") 


Published empirical key for MTAI 50.7 5 |.90** .26 
Dichotomous-logical key for MTAI 11.8| 17.1 | .93** 23 
Trichotomous-logical key for MTAI 88.2 9 |.94** 28 


* Horst (4). 
** Corrected split-half. 


reévaulated in comparison with our “logical” key; it seems prob- 
able that the empirical validity of the MTAI can be improved 
by using a logical scoring key. 

Of even greater potential significance is the implication that 
what the MTAI measures can be conceptualized in terms of rela- 
tively simple and communicable theory. If we can abandon the 
complex and arbitrary-seeming published scoring weights for the 
items, the scoring key becomes readily comprehensible and pre- 
dictable. Thus, we have found that three graduate students in 
educational psychology—instructed to assign scoring weights in 
accordance with a permissive, child-centered, non-authoritarian, 
and emotionally secure orientation—were able to predict the logical 


scoring weights unerringly. 
SUMMARY 

The Minnesota Teacher Attitude Inventory had slightly higher 
reliability and validity when scored with a logical key than when 
scored with the published empirically derived scoring weights, 
which are non-monotonic for ninety of the one hundred fifty items, 
Practically, this suggests reéxamination of the published key. 
Theoretically, this means that the MTAI’s meaning becomes 


relatively easy to formulate. 
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STABILITY OF INTRA-INDIVIDUAL 
PATTERNING OF MEASURES OF 
ADJUSTMENT DURING 
ADOLESCENCE 


FRED T. TYLER 


University of California 


INTRODUCTION 


According to certain theoretical positions, as indicated in vari- 
ous textbooks, there is a high degree of constancy and consistency 
in much of the individual’s behavior. Empirical evidence on this 
point is relatively scarce since problems related to the behavior 
of the individual over a period of time have received much less 
attention than have those dealing with the behavior gf groups of 
individuals at some specific point of time. Yet understanding and 
prediction of an individual's behavior depend upon knowledge con- 
cerning stability and consistency of his behavior: stability in the 
behavior of a group of subjects is no guarantee of stability in each 
member of the group. 

The importance of questions dealing with the stability of per- 
sonality is apparent: prediction is feasible if behavior is consistent 
over a period of time. According to Olson, prediction is possible 
when growth is continuous: "The fact that there is continuity to 
the growth process makes it possible to project curves and to pre- 
dict” (6, p. 36). However, the present writer believes that predic- 
tion depends more upon stability and consistency than upon con- 
tinuity of growth, since, in our opinion, growth may be continuous 
even while being irregular and erratic. Olson appears to accept the 
concept of stability of personality: “The stability of personality 
results in a strong resistance to displacement by temporary en- 
vironmental variations” (6, p. 276). 


ANALYSIS OF THE INDIVIDUAL 


Analyses of physical and behavioral data of the single case have 
been, in the past, primarily graphical in nature, although other 
procedures are beginning to appear. Kerlinger (5) employed an 
analysis of variance technique to investigate the validity of the 
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principle of unity of growth, or the concept of “homogeneous” and 
“heterogeneous growers,” to use his terms. However, the “sub- 
tractive” scores he used in his analysis do not seem appropriate 
since consideration of his methodology shows that the use of “sub- 
tractive” scores may indicate the presence of heterogeneity of 
growth when in fact the growth ages imply homogeneity; and vice 
versa. Tyler (9) illustrated correlational [P-technique (1)] and 
factorial analyses of physical data for the individual subject. The 
effects of “staggering” longitudinal data to take into account the 
fact that growth in different physical characteristics is sequential 
rather than synchronous in nature are also under investigation by 
means of correlational analyses. Analysis of variance of a variety 
of behavioral data expressed in standard scores has also been used 
to investigate the validity of the principle of unity of growth. 
Spearman’s rho and Kendall's W also appear to offer possibili- 
ties for the statistical analysis of data from the individual case. 
The present writer recently reported, for a series of behavioral 
measures, the values of rho between the measures obtained at two 
ages in the adolescent period for each of sixteen boys.? These rhos 
were 0.74, 0.71, 0.67, 0.63, 0.60, 0.54, 0.52, 0.46, 0.44, 0.42, 0.33, 0.09, 
0.08, and —0.42, indicating a tendency for some 
patterning of the scores on the tw 


but not for others; the degree of agreement between measures 
having reference to two occasions varies from one individual to 
another, and even the largest rhos imply some considerable disa- 
greement among the rankings of the characteristics at different 
ages, 

The concept of unit 
Seem to allow for 
degree to which su 
However, it seem 
the extent to whi 
constant during 

The data to b 


similarity in the 
o occasions for some subjects, 


y among measures of the individual does not 
the possibility of individual differences iP the 
ch measures are idiomatic within the individual. 
S certain that there are individual differences 19 
ch the individual's pattern of adjustment remains 
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TABLE I.—RELIABILITY COEFFICIENTS FOR CATEGORIES OF 


ADJUSTMENT 
Test-R. à = iabili 
— Eus | euin 
Occasion Occasion 
Name 
4 1 2 3 
Social maladjustment 74 71 .81 var 
Personal inferiority .68 62 .60 45 
Overstatement ate 64 .58 52 
Family maladjustment .71 .44 .40 .80 
Physical symptoms .75 74 .68 72 
Fears .70 .69 .72 91 
Generalized tensions .76 .78 .73 .76 
School maladjustments .67 .88 .92 91 


ment. Their designations and reliability coefficients are shown in 


TableI (7, 8). 

Our questions may be formulated as follows: 

(1) What is the degree of agreement within an individual among 
seriatim measures of adjustment obtained during adolescence? 

(2) Is the agreement significant for some subjects but not for 
others? 

(3) Is the agreement su 
accurate predictions from ear 

(4) Is the degree of agreem' 
eral adjustment? 


fficiently large to permit reasonably 
lier to later ages of the individual? 
ent related to the individual's gen- 


THE ANALYSIS 


The nature of the questions to be considered and the method 
of analysis to be employed will be described by means of the data 
in Tables II and III; the measures are standard scores (based upon 
grade groups) for each variable at each chronological age. We may 
determine the extent to which each category of adjustment occu- 
pies the same rank among the other categories at each of several 
ages. Measures of the relationships between the ranks at different 
ages may be computed by means of Spearman's rank-order 
method, as for the data in Tables II and III. 

The values of rho for the three possible pairings of ages are 0.55, 


? Case 134 in the California Adolescent Growth Study. 
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TABLE II.—Sranparp SCORES ON THE CATEGORIES OF THE U. C. 
Inventory I BY CHRONOLOGICAL AGE 


(Case 134) 
Category of Adjustment 
CA 
1 2 3 4 5 6 7 8 
1 51 20 50 36 33 72 46 45 
13 76 61 57 54 45 63 70 66 
17 71 44 49 48 43 72 54 48 


CA 1 2 3 4 5 G 4 3 
m 2 8 3 | 6 A 4 | 5 
13 1 5 6 7 8 2 3 
17 2 7 € ee) 8 | d 3 | 5.8 
Sum 5 20 13 18.5 23 6 9 13.5 


0.96, and 0.67, implying a tendency for the variables to occupy 
similar ranks at different ages. [This type of coefficient of corre- 
lation is termed 0-technique by Cattell (1)]. 

The number of rhos to be computed increases rapidly as the 
number of ages for which data are available increases, so that the 
amount of computation increases rapidly with an increase in the 
number of rankings. In any case, a more general question, leading 
to a more general answer, and involving a limited amount of "e 
culation, might be asked: Is there agreement among the 4? 


assigned to each category of adjustment for the three ages °” 
sidered? 


Kendall (4) pro 
able techni 


; in which 


th 
computation of W, are ey 


indicated. 
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The coefficient of concordance is defined as follows (2): 


W= Sum of squares between columns 
Total sum of squares 


(5? + 20° --- 9 + 13.5°) _ 3 X 8(9)° 


The between vari EM EONTR 
ance 3 i 
= 102.17 
3 3 
Th * _ m(n—n  3(8 —8) 
e total variance i i2 126 


(where m is the number of rows and n the number of columns). 
Then, for the data of Table III: 


The “between” variance = 102.17 
and the “total” variance = 126 


102.17 _ 
so that W = 398r =0.81 


The significance of W may be determined by means of the x? 
test: 


» (n — 1) (sum of squares between columns) 
gi (n — n)/12 


7 X 102.17 
= = 17.0 
(8 — 8)/12 3, 


where there are (n-1) degrees of freedom. 

For our example, x? = 17.03, which just fails to reach signifi- 
cance at the 0.01 level. We conclude that there is a high probability 
that the agreement among the patterns of adjustment at different 
ages is not due to chance. 

It might also be of interest to know the mean value of all the 
rank-order coefficients of correlation in our m X m table. This 
mean value may be determined from the following formula (2): 


4. mW-1 3X081-1 
sU 2 


= 0.72 


In our example, the value of 0.72 is almost identical to the mean 
value of the three rank-order coefficients already reported for 
Table III. 
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TABLE IV.—DisTRIBUTION OF VALUES oF W 


Interval Number of W's 


.80-.89 
.70-.79 
-60-.69 
-50-.59 
.40-.49 
.30-.39 
.20-.29 
.10-.19 


2eu0coooos 


Total 


es 
© 


Similar computations were made for thirty boys, all members 
of the California Adolescent Growth Study. For twenty-eight boys, 
eight categories of adjustment were available for ranking on each 
of seven annual occasions (i.e., chronological ages); measures 
were known for each of six occasions for the other two subjects. 
The value of W for significance at the 0.01 level is 0.38 for the 
7 X 8 Table, and .44 for the 6 x 8 Table (W was corrected for 
continuity and for tied ranks, although these corrections resulted 
in but minor changes in the value of W). 

The distribution of the values of W is shown in Table IV. About 
50 per cent of the W's are significant at the .01 level. We conclude 
that there are individual differences in the presence of significant 


R 
TABLE V.—DisrRIBUTION OF VALUES OF THE MEAN or RANK-ORPP 


CORRELATIONS 
Interval No. of Mean Rhos 
.80-.89 1 
-70-.79 1 
-60-.69 2 
.50-.59 3 
-40-.49 2 
-80-.39 6 
-20-.29 5 
.10-.19 1 
-00-.09 3 
rS NR 
"Total 
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agreement among the rankings of the categories of adjustment 
being considered. 

The mean values of all rank-order coefficients of correlation for 
each subject were also computed according to the formula already 
indicated. The distribution of mean rhos is shown in Table V. 
Again, it is obvious that there are individual differences in the 
degree of agreement among the profiles at different chronological 
ages. Further, it appears that the prediction of later from earlier 
profiles must be accompanied by the possibility of large errors. 
The patterning of the categories of adjustment measured by the 
U. C. Inventory I does not confirm a hypothesis of unity of 
growth or of homogeneity of growth for these subjects. 


DISCUSSION 


It may be that the non-significant values of W are the results 
of changes in the rank orders of the categories in a profile that 
arose from the unreliability of the measures. If this is so, we might 
expect that the differences between the standard scores for suc- 
cessive ranks in each profile will be less in the cases of those sub- 
jects with the non-significant W’s than of the other subjects. We 
determined the distribution of these differences for each of twenty 
subjects, ten with significant W's and ten with non-significant 
W's. The median value for each distribution was then computed; 
these are shown in Table VI. 

It is seen that the medians are very similar for the two groups; 
that is, the differences between the standard scores for consecutive 
ranks on any occasion may be as small in the case of the significant 
as of the non-significant W's. The absence of significant agreement 
among the categories does not appear to be determined solely by 
unreliability of the measures of adjustment. Some might argue, 
however, that the presence of several non-significant values of W 


Taste VI.—Mep1an VALUES or DIFFERENCES BETWEEN STANDARD 
Scores REPRESENTED BY CONSECUTIVE RANKINGS FOR Ten SUBJECTS 


WITH SIGNIFICANT W’s AND TEN WITH Non-Sraniricant W’s 


Group Median Values 


With significant W’s 3 
With non-significant — 3. 
W's 


.0 4.0 4.1 4.5 5.0 7.0 
.1 4.3 4.3 4.5 4.6 5.2 
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may be a reflection of unreliability of the measures of adjustment 
(see Table I). 

We may now consider the fourth question that was proposed 
earlier, namely, is general adjustment related to consistency of 
adjustment over a period of years? The subjects of the California 
Adolescent Growth Study were rated by each of three psycholo- 
gists at the end of the school period on each of five categories of 
adjustment, known as “manifest traits,” as follows: 

(1) Social prestige 

(2) Popularity with same sex 

(3) Work adjustments 

(4) Heterosexual adjustments 

(5) Personality adjustment 

The subjects were rated on a seven-point scale, the final rating 
being the average of the ratings by the three psychologists. Rat- 
ings on these five “manifest traits” were available for twenty of 
our thirty subjects, nine having non-significant W’s and eleven 
having significant W’s. The ratings of each subject on each trait 
are shown in Figure I; the mean ratings are also indicated for each 
group. Successful adjustment as rated on the "manifest traits" 
does not appear to be related to significant agreement among the 


8 C D E 


on "manifest traits”: those with nonsignificant W's 


———— wv — Á 
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categories of adjustment measured by the U.C. Inventory I during 
the adolescent years. Stability in the patterning of the individual’s 
self-concept is not related to the psychologists’ concept of the sub- 
ject 's adjustment as evaluated by them at the end of the school 
period. (The rank-order correlations between the value of W and 
the ratings on each of the five “manifest traits” for the twenty 
subjects were 0.07, 0.16, 0.09, 0.16, and —0.03.) 


SUMMARY 


The patterns of adjustment, as revealed by a self-report inven- 
tory administered annually to the subjects of the California Ado- 
lescent Growth Study, were analyzed by means of Kendall’s W 
to determine the degree of agreement in the patterning of measures 
on eight categories of adjustment on each of seven annual occa- 
sions, The valuc of W was found to be significant for one half of 
the subjects. The presence of non-significant W’s in some subjects 
did not seem to arise simply from the unreliability of the measur- 
ing instruments since the median differences between the standard 
scores of consecutive ranks within each profile were about the 
same for the subjects with significant W’s as for those with non- 
significant W’s. 

The presence of stabi 
self-concept on the eight c 
to successful adjustment as judged by 
end of the school period. 

It is believed that the na 
able to the hypothesis of a 


lity in the patterning of the individual’s 
ategories of adj ustment was not related 
three psychologists at the 


ture of the results reported is not favor- 
high degree of stability in the adoles- 
cent’s reported self-concept. There are individual differences in 
the degree of stability; in all cases, prediction of later from earlier 
measures of adjustment would result in some considerable degree 


of error. 
There remains, of course, the problem of inferring the nature 


of the individual’s behavioral characteristics from a knowledge of 
the changes in the scores on a self-report inventory. It may be ar- 
gued that the individual’s pattern of adjustment is, in reality, more 
stable than the profiles imply. Changes in the patterns may reflect 
changes in expressions of adjustment and maladjustment. An ado- 
lescent’s relationships within his family may become ne unbea 

able that he comes to the point of indicating good adjustment in 
the family and at the same time changes from indications of good 
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to poor adjustment in school. Or, again, he may indicate a change 
from unsatisfactory to satisfactory adjustment in school, but at 
the same time his "generalized tensions” may become much more 
prominent in his self-report inventory. 

The data analyzed and reported here indicate that the pattern 
of the adolescent’s social and emotional adjustment, as measured 
by the U.C. Inventory I, may be quite erratic during a seven-year 


period. A different type of analysis may provide evidence relative 
to the problem just discussed. 
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PUPIL GROWTH IN READING—AN INDEX 
OF EFFECTIVE TEACHING 


HAROLD E. MITZEL and DONALD M. MEDLEY 


Division of Teacher Education, Municipal Colleges of New York City 


The fact that much of the recent sharp criticism of modern pub- 
lic education has centered on the teaching of reading reflects the 
importance that the general public attaches to this phase of in- 
struction. The volume of study and experimentation devoted to 
methods and materials of teaching reading indicates that educa- 
tors, too, attach great importance to this part of the school cur- 
rieulum. One important phase of the problem seems to have been 
neglected, however—the róle of the teacher in the teaching of 
reading. A recent review of quantitative studies attempting to 
evaluate teacher effectiveness reveals few that employ reading 
growth of pupils as à criterion of effectiveness (5). The general 
conclusion to be drawn from these few studies is that little or noth- 
ing is known about what behavioral and personality characteris- 
tics distinguish the effective teacher of reading from the ineffective 
one. 

Any attempt to id 
the development of a 
of the teacher on the job 
lated. This report descri 


entify such characteristics must begin with 
reliable criterion measure of the effectiveness 
to which measures of behavior can be re- 
bes the development of such a criterion. 
The chief difficulty that arises when a measure of a teacher’s 
effectiveness is sought in the growth scores of her pupils is the 
difficulty in separating the teacher’s contribution from all of the 
many other factors which affect the scores. Unless a study is re- 
stricted to the teachers in a single school, thereby losing most of 
its generalizability, different teachers will teach in different schools 
located in different communities or neighborhoods. And, if ele- 
mentary-school teachers are being studied, no two teachers will 
teach the same group of pupils. If the mean growth of one teacher’s 
class in one school is greater than that of another teacher’s class 
in another school, it could be argued that this difference results 
from differences in schools or neighborhoods, and from differences 
in pupils, and that perhaps none of the difference can be attributed 
to a difference in teachers. Suppose one teacher has a high-ability 
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group in a new, well-equipped building in a middle-class neighbor- 
hood; another has a “slow” section in a superannuated building 
in a slum area. Can these teachers be compared on the basis of 
their pupils’ progress? 


sible to test the latter against the f. 
school differences may be ignored; if they are greater, comparisons 
among teachers should be made only within schools, 

Controlling class differences. The best way of dealing with class 
differences among pupils is to assign pupils in the same school to 
, then it is safe to con- 
different classes in ex- 


Unfortunately, 
ment of pupils to 


there are two important sources of variation are 
ch are of primary concern: past achievement and Ram 
arning. It was decided that these two variables ven 
i covariance analysis. Each pup! 
m reading and in mental maturity at the beginning © 


tude for Je 
be control] 
Was tested 
the school 
A note 


a Je u 
dicated by the cd Teading ability possessed da from the 


of th is inistered to a 
n den e pupils Scores, A reading test admini 


ity (in comparison with other 
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school career up to that time. Similarly, a pupil’s score on a reading 
test at the beginning of the year indicates how much he has ac- 
quired before coming to his present class. His score on an intelli- 
gence test administered at the same time as the initial reading 
test is a partial measure of his aptitude for learning. The covari- 
ance technique predicts, on the basis of performances of all the 
pupils at the beginning of the year, what score each pupil may be 
expected to achieve at the end of the year, and then compares the 
score he actually obtains with that score. It is the pupil whose 
final score is farthest above his own expected score whom this 
method regards as having the highest score, instead of the pupil 
farthest above the group mean. 

In other words, the effect of application of covariance adjust- 
ments is to start all pupils off as though they had all had the same 
score on the initial reading test and on the intelligence or mental 
maturity test. 

Other factors. It should be recognized that there may be other 
differences among these classes (average daily attendance, for in- 
stance) that are not measured by the initial tests. Such variables 
have not been controlled in this study, and remain to contaminate 
teacher differences, masking to some extent the real differences 
among teachers. It is necessary to assume that the effect of such 
variables is negligible.! Since, so far as could be ascertained, these 
variables were not used as a basis for assigning pupils to classes, 
this assumption is not unreasonable. 


COLLECTION OF DATA 


Sample. In the fall of 1954, a follow-up study was made to locate 
those of the one thousand six hundred twenty-eight teachers in the 
student teaching class of 1953-54 from the municipal colleges who 
were teaching either as regular teachers or permanent substitutes 
in New York City public elementary schools. Each teacher who 
was assigned to teach in grade three, four, five, or six in a school in 
which there was at least one other such teacher was asked to par- 
ticipate in the part of the project under discussion, " 

Of the seventy-five teachers who met these criteri i 
were eliminated either because the schools were web n 

e . : . , E 
cause the principle or one of the teachers in a school did not wish 


1For an analysis of the implications of such i 
i assumpti i 
means if they are not fulfilled, see Lindquist (3). DR mud. ED 
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to codperate, or because one of them failed to accept a proffered 
teaching position. Participation in the study was voluntary, but 
partly because of the rather heavy pressure exerted by district 
superintendents and principals on each teacher to participate, the 
number of teachers omitted at their own request was small—not 
greater than three or four. Two principals failed to coóperate; one 
was the victim of a sudden increase in school population and felt 
that he had so many problems as a result that he was justified in 
dropping out; the other maintained that the additional burden on 
a beginning teacher was too great to justify her even being asked 
to coóperate. 

Of the fifty-six teachers whose classes were tested in the fall, 
seven were dropped from the study during the year. Whenever one 
teacher in a school with two candidates in it withdrew, the other 
teacher was perforce also lost. Table I shows the distribution of the 
sample by schools and grades. 


TABLE I.—GnapE LEVELS TAUGHT BY 49 TEACHERS IN 19 Scuoors 


Grade Level Taught 
School No. Total 
III IV V VI 

T 1 1 2 

2 2 2 4 

3 1 1 2 

4 1 1 i! 3 

5 2 3 

6 2 2 4 

7 1 1 1 3 

8 1 1 1 3 

9 2 2 
10 2 2 
11 1 1 2 
12 2 2 
13 2 
14 à 2 i 2 
15 1 1 2 
16 1 2 
17 1 3 1 3 
18 4 1 5 
19 1 : 2 

| 

Totals P uw | a — Po emm 49 
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Test data were incomplete at the beginning of the analysis in 
four classes, so that the covariance analysis to follow was based on 
forty-five teachers. Three of these forty-five teachers were men. 
Except for these selective factors, and the requirement that there 
be two or more new teachers in each school, the sample appears to 
be representative of beginning teachers in New York City public 
elementary schools. 

Tests used. To measure the reading ability of pupils the authors 
selected four sub-tests of the California Reading Test (Elemen- 
tary)—Word Form, Word Recognition, Meaning of Similarities, 
and Interpretation of Meanings. These four sub-tests have a total 
of eighty-seven test items. Three sub-tests entitled Meaning of 
Opposites, Following Directions, and Reference Skills, were omit- 
ted in order to shorten the administration time. The California 
test was chosen as the reading measure for the investigation for 
the following reasons: 

(a) It seems to measure aspec 

with the instructional objectives 0 


school system. 
(b) It offers coverage of a wide range of reading skills with 


norms given for grade equivalent scores from 2.0 to 9.8. 


(c) Equivalent forms are available. 
(d) The format does not require complicated marking behav- 


iors on the part of the pupils being tested. 

To estimate the learning capacity of the pupils in this study 
the 1950 Short Form of the California Test of Mental Maturity, 
Non-Language Section was used. The non-language section, con- 
sisting of four sub-tests totaling sixty items, was chosen for the 
following reasons: 

(a) It makes small demand on pupils’ reading ability. 

(b) It is convenient to administer during a single school period. 

(c) It has a range of mental age scores from 5.0 to 20.0. 

(d) The format does not require complicated marking behav- 

iors on the part of the pupils being tested. 

The manner in which the data were collected implies the fol- 
lowing major assumptions: 

(a) That the four portions of the California Reading Test pro- 

vide adequate coverage of the reading objectives of New 
York City teachers. 
(b) That the pupils in each class who were present at both ini- 


ts of reading ability consistent 
f the New York City 


232 The Journal of Educational Psychology 


tial and final test administrations are representative of the 

whole class taught by each teacher. 
These assumptions are made 
tions underlying the st 


tered to the remainder, F; 
Maturity, Non- 


test, given after a lapse of 
tered by the examiner wh 
of the initial testing was 


ANALYSIS OF RESULTS 


Differences in final reading score means. An analysis of variance 
of the final rea ing scores (that is n 


TABLE TL—Anatysig OF VARIANCE oF Finan READING SconEs 


OF 826 Purius IN 45 CLASSES 
Source D.F. Mean Square F 
Witt lasses 44 2361.598 21.71** 
asses 781 108.788 
Total a eee cs See 


** Signin, 
Cant at the 0.01 level. 
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TABLE III.—ANaLYsis or VARIANCE OF FINAL READING SCORES 
or 826 PUPILS IN 45 CLASSES WITH INITIAL READING SCORES 
HELD CONSTANT 


Source D.F. Mean Square F 
Between classes 44 376.384 7.54** 
Within classes 780 49.934 

Total 824 


** Significant at the 0.01 level. 


points, while the difference between two pupils in different classes 
is, on the average, about forty-nine points. 

The fact that the average difference between pupils in different 
classes is more than four times as great as that between pupils in 
the same class suggests that there may be significant differences 
among classes. That there are significant differences is indicated 
by the F-ratio of 21.7, which has a probability of less than 0.01. 
"These differences are probably due in large part to the fact that 
the classes are not all at the same grade level. One would expect 
most sixth graders to read better than most third graders. Whether 
or not these differences may be attributed in part to differences 
in the quality of the teaching exhibited in the different classes 
cannot be inferred from this analysis. 

Mean differences adjusted for initial reading scores. When the 
mean squares are “adjusted” to allow for differences in initial 
scores—when, that is, the departure of each pupil's score from the 
score predicted for him on the basis of his initial score is used in- 
stead of its departure from the mean of all the final scores—the 
mean square between classes is reduced from two thousand three 
hundred sixty-two to three hundred seventy-six and the mean 
square within classes from one hundred nine to fifty. These results 
are shown in Table III.2 

The average difference between pupils in the same class is still 
seven points; indicating that even when initial status is allowed 
for, there are considerable differences in the effectiveness of a 
teacher with different pupils. The average difference between pu- 
pils in different classes ig now about nineteen points. This means 
that, even if all of the pupils in all of the classes had started out 


—— 2 
? Computational procedures used are given in detail by Johnson (2). 
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TABLE IV.—ANALYSIS OF VARIANCE OF Finan READING SCORES OF 
826 PUPILS IN 45 CLASSES WITH INrTIAL READING AND MENTAL 
Maturity Scores Hetp Constant 


Source D.F. Mean Square F 
Between classes 44 386.400 T.9/** 
Within classes 719 48.496 

Total 823 


** Significant at the 0.01 level. 


with the same reading score, the difference between pupils with 
different teachers would have been greater than that between pu- 
pils with the same teacher. The F-ratio of 7.54, significant at the 
one per cent level, justifies this conclusion. 

Adjustment for mental maturity. It is still doubtful whether 
these differences represent differences in teachers’ skill, since the 
average learning ability (in the sense of mental maturity) of 
classes might differ. A second “adjustment” was therefore made for 
mental maturity, with the results shown in Table IV. 

Here the average difference between classes increases slightly 
while the average difference within classes decreases slightly, in- 
creasing the F-ratio to 7.97. Most of the differences among pupils 
in mental maturity seem to have been reflected in differences in 
initial reading score, and their effect was removed in the first ad- 
justment. 

At this stage it seems justifiable to conclude that there are dif- 
ferences in average effectiveness among these forty-five teachers. 
Allowing for differences among pupils with respect to initial read- 
Ing scores and mental maturity scores, there remains an average 
difference among pupils with the same teacher in final reading 
Scores of 6.9 points, while the average difference between pupils 
With different teachers is 19.7. A given teacher is not equally ef- 
fective with all pupils, but some teachers are more effective on the 
average than others, 
ustimation of reading growth indices. The next step in the anal- 
8 was the estimation of the adjusted mean final reading test 
Score of each class—the reading growth index.? Similar indices 
Were computed for the four classes not used in the analysis. For 


ysi 


? The method of estimation used is that described by Cochran and Cox (1). 


=" 
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comparison, mean final reading test scores adjusted for initial 
reading scores only, and unadjusted mean final reading scores were 
also computed. 

Table V shows the class mean final reading scores, unadjusted, 
adjusted for initial score only, and adjusted for both initial score 
and mental maturity score. The grade levels and class sizes are 
also shown. The first, third and fifth columns show the ranks of 
the classes on each of the three sets of means. 

Effects of covariance adjustments on rank order. The rank order 
of the classes changes considerably with the first adjustment. The 
class which ranks nineteenth in unadjusted final scores ranks third 
When adjusted for initial scores; the second highest score moves to 
the thirty-first place when adjusted. The power of the covariance 
method is now apparent. In addition to eliminating much varia- 
tion not related to teacher effectiveness, reducing the average in- 
terelass difference from forty-nine to nineteen points, covariance 
adjustment also rearranges the rank order of the classes. When the 
Initial scores are allowed for, the class which has the second- 
"en final score is seen to have shown less improvement over 
nitial standing than thirty other classes. 

The ranks roe relates little with the second adjustment— 
that for mental maturity—although the class originally second 
Moves down another three places to the thirty-fourth rank when 
Mental maturity is allowed for. This third set of ranks represents 

1e nearest approximation to the correct rank order of these forty- 
ae teachers in ability to promote reading growth that can be in- 
erred from à: 

Study of 21 — and Grade Differences. Two more 
factors which may affect the differences among classes were 
Studied—school differences and grade differences. Table VI shows 
te variance of reading growth indices 


results of an analysis of ; 
© Components between and within schools. The average differ- 


ence between classes in different schools is 5.5 points while the 
Average difference between classes in the same school is 48; al- 
-10ugh the former is somewhat larger than the latter, the difference 
18 not Significant. It may, therefore, be concluded that differences 
“Mong schools in the amount of pupils’ reading growth were not 
detected in this study. . ] 

, Table VIT shows an analysis of variance of reading growth in- 

ces into components between and within grades. Pupils in dif- 


TABLE V.—CoMPARISON OF UNADJUSTED, SINGLY ADJUSTED AND 
DOUBLY ADJUSTED FINAL READING Scorp Means or 49 
ELEMENTARY SCHOOL CLASSES 


i i Final Reading Score 
Final Reading Score) Adjusted for initial i 
Score and Mental ride Number 
Maturity Eel of 


Fu easing Score Adjusted for Initial 


Score 
Pupils 


Class and Rank Mean Rank Mean Rank Mean 


ed 


1 74.12 2 | 58.56 2 | 58.40 5 25 
2 69.43 | 31 | 47.71| 34 | 47.24 6 21 
3 65.52 15 51.10 16 | 51.14 5 21 
4 65.36 6 | 56.79 6 | 56.86 5 14 
5 65.14 1 60.43 1 60.43 4 7 
6 64.60| 18 | 50.67] 17 | 50.80 4 15 
7 63.75 | 24 | 49.32] 26 | 49.00 4 28 
8 63.19 5 | 56.88 7 | 56.69 5 16 
9 58.70| 17 | 50.95] 20 | 50.50 5 20 

10 57.57 | 10 | 54.05] 12 | 53.57 3 21 

1 57.50 8 | 55.92 8 | 56.00 3 24 

12 56.65| 21 | 50.12] 25 |49.29 6 17 

13 56.40 | 19 | 50.40] 19 | 50.53 4 15 

14 56.14 | 20 | 50.29) 23 | 49.48 6 21 

15 54.77 4 | 57.23 3 | 58.08 6 13 

16 54.07 | 30 | 47.93] 29 | 47.78] 4 27 

17 53.72| 9 |54.41| 10 | 54.45] 3 29 

18 52.95 | 22 |50.05| 21 | 50.05 4 19 

19 52.75| 3 |68.08|] 4 | 57.71 5 24 

20 52.03 | 32 | 47.48| 32 | 47.34 3 29 

21 51.66} 11 |53.91| 11 |53.97| 3 z 

22 50.96 7 |56.24 5 | 57.04 5 z 

23 50.21 | 14 |52.38| 14 | 52.42 3 : 

24 49.92 | 28 |48.31| 31 | 47.62 5 "i 

25 48.29 | 29 | 47.94] 27 | 48.12 3 

26 47.67| 13 | 53.42] 13 | 53.08 4 s 

27 45.56 | 16 | 51.06] 18 | 50.75 3 a 

28 44.13 | 27 | 48.50] 28 | 47.96 4 T 

29 43.06 | 12 | 53.50 9 | 54.75 3 i: 
30 43.00| 34 | 47.32| 344 | 47.05] 3 
31 42.05| 33 | 47.40| 344 | 47.05] 3 4 
32 41.33 | 39 | 44.67] 42 | 44.00) 4 18 
33 40.94 | 25 | 49.22] 22 | 49.50 3 ad 
34 40.86 | 26 |49.00| 24 | 49.43 3 49 
35 39.68 23 49.89 15 51.37 3 
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TABLE V.—Continued 


Final Readi, Final Reading Si Final Reading Score 
"Una ans Score Adjusted for initial | Agiusted forniual | eer 
core Maturity Pavel po us 
E 
Class and Rank | Mean Rank | Mean | Rank | Mean 
36 39.50! 36 | 46.86] 36 | 46.36] 5 14 
37 39.29] 46 | 41.59] 47 | 41.47] 4 17 
38 37.59 | 35 ļ|47.18| 30 |47.76| 3 17 
39 35.50 | 45 |42.09| 46 |41.82| 4 22 
40 35.11] 40 |44.47| 40 |44.79| 4 19 
4 35.09] 41 | 44.26] 41 |465| 3 x 
42 33.38] 42 | 44.00] 43 | 43.88] 4 2 
2 31.13] 44 |42.13) 45 |4225| 3 k 
44 30.1) | 43 | 43.22] 38 |4544| 3 S 
2n 30.00 | 37 |44.s8| 37 |45.62| 3 
2 
a 20.80! 38 | 44.82) 39 |45:1| 3 2 
F4 27.42) 47 | 41.424 44 | 42.40) 3 = 
P^ 20.85| 49 |40.08| 48 |438] 3 : 
28.38 | 48 |40.50| 49 | 41-00) $ | ——. 
Weighteg mae 7 48.75 | Total | 904 
rand Mo, | 2:75 48.75 
Range E 19.43 


50.74 20.35 


: hich was 
dor he Mean of 41.49 was calculated from a total, a portion of whic 
regression methods. a 

f oints. 
What Srades show a difference on the hap ieee = that 
betwen Or not this difference is significantly E t because the 
Patio Pupils in the same class remains Jo 

R i has a Probability between 0.01 and 0.05. iance analysis 
is ,CUabilig : step in the covar 
8 to est; tY Estimate, The final step 


i indices. The re- 
bacs the reliability of the reading growth indie 
1CE-ÀDJUSTED CLASS 


Tw: 
"ANALYSIS or VARIANCE OF A yrEFBRENCES 
Means vo Tast ror SCHOOL a 


So Square 
‘ urce D.F. Mean ue cc 
Sw, 
e 
Wit ^ 


ith; 2 Sch 30.461 
it oo]; 
| ER 
Total =n 
ES SS 


Taarn VI 
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TABLE VII.—ANALYSIS OF VARIANCE OF 49 TwicE-ADJUSTED 
Crass MEANS TO Test ror GRADE EFFECTS 


Source D.F. Mean Square p 
Between grade levels 3 71.884 $.18* 
Within grade levels 45 22.578 
— 
Total 48 


* Significant at the 0.05 level. 


liability of a test in any group is the ratio of the variance of the 
true scores to the variance of the obtained scores of that group. 
In this instance, the variance of the obtained scores is estimate? 
by the mean square between classes in Table IV. The error vari- 
ance is estimated by the mean square within classes. If the error 
variance is subtracted from the obtained score variance an esti- 


mate of the true score variance is obtained; the reliability maY 
therefore be estimated to bo: 


, — 386.400 — 48.496 
386.400 


This coefficient may be interpreted as representing the expected 
correlation between the mean reading growth indices of the forty- 
five classes studied and the mean reading growth indices tha 
would be obtained by forty-five other classes taught by the same 
teachers. Its magnitude indicates that had these teachers bee? 
assigned to other, similar classes, their rank order would not hav® 
differed greatly from that shown in Table M. 


= 0.874 


SUMMARY AND DISCUSSION 


_ A follow-up of one thousand six hundred twenty-eight students 
in New York City municipal colleges located seventy-five of the™ 
beginning their teaching careers in grades three, four, five, Or ~ 
in City public elementary schools which were employing at wel 
two such teachers, Of these teachers, nineteen were eliminated an 

the classes of the other fifty-six were tested with a reading test 
and a mental maturity test in the fall of 1954 and forty-nine O 

them were tested with equivalent forms of the reading test the 
following spring. 


: : T imate 
The analysis of variance and covariance was used to estimat 
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the average effectiveness of each teacher in stimulating her pupils 
to learn to read, taking into account: 

(a) Differences in learning aptitude among pupils. 

(b) Differences in previous reading achievements among pupils. 

(c) Differences in grade level. 

(d) Differences in average amounts of improvement in reading 

ability among pupils in different schools. 

The classes of these forty-nine beginning teachers were found 
to differ widely in the average amount of improvement in reading 
ability over one year, even when allowance was made for differ- 
ences in aptitude and amounts learned in previous years. These 
differences were no greater between classes in different schools 
than between classes in the same school. In general, there was 
Some evidence that beginning teachers’ effectiveness in the area of 
reading varies for different grades, but the evidence was not con- 
clusive, The reliability of the differences in average reading im- 
Drovement among classes was such that if the study had been a 
Peated with the same teachers in other classes, the mean growth 
Scores of the classes would have correlated 0.87. — 

It was therefore concluded that there are substantial neers 
“mong beginning teachers in New York City public schools d ese 
fectiveness in stimulating pupils to learn to read, and that nt 
differences cannot be attributed wholly to Dicen es 
Schools nor to differences in pupils' learning ability or pre 


Achievement. 


REFERENCES 


Wil - G. Cochran, and G. M. Cox, Experimental Designs, P- 
"Rod New York 
ud - O. Johnson, Statistical Methods in Research, pp. 246-206, New York, 
‘ce-Hall, 1949 j ; hol d 
ir : is of E ents in Psychology an 
E.F. Lindquist, Design and Analysis qu ipn: to 


78, New York, 


" ducatis 
n, pp. 323-325 and 328-330, New : 4 1 of Rupert 
a ing ; : i cy,” Journa 

menta J. Ling; "The Prediction of Teaching Efficiency: 


! Educati " . 
8 H.E, pd E p A Critical Review of the — gem 
dit “pil Growth Criteria in Studies of Teacher Effectiveness, 

dre 31, Division of Teacher Education, Boar 
ity of New York, April 1956. 


A FURTHER INVESTIGATION OF THE 
RELATIONSHIP BETWEEN ANXIETY 
AND CLASSROOM EXAMINATION 
PERFORMANCE 


ALLEN D. CALVIN, F. J. McGUIGAN, and MAURICE W. SULLIVAN 


Hollins College 


In a recent paper McKeachie, Pollie, and Speisman (3) found 
that students who were given the opportunity to write comments 
about their objective examination questions performed better OD 
the last half of their examination than students who were no 
given an opportunity to comment. They pointed out that this 
finding “... fitted in with our theory that tension is built UP 
throughout the test and that giving opportunity to comment Te- 
duces the increasing tension." (8, p. 95). They also reported a r°- 
lationship between examination scores and McClelland’s “nee 
for achievement” (2) personality variable. On the basis of their 
results they concluded: "Anxiety inhibits performance. GivinÉ 
students an oportunity to write comments aids not only in ae 
ducing the threat but also in channeling the release of anxiety 
(3, p. 98). 

Since McKeachie et al., themselves, noted after their first €** 
periment that “our results seemed too good to be true”, an attemP ` 
to replicate their findings seems called for. In addition, the fact 
that they had no direct measure of anxiety indicates the need for ap 
experiment where such a measure is provided. The following study 
is therefore an attempt to replicate McKeachie et al.’s results 
and to test their “anxiety reduction” hypothesis. 


METHOD 


Subjects. The Ss, one hundred and fifty-two undergraduate “a 
male students from Hollins College, were taken from five classe 
as follows: sixty-one and thirty-five from two introductory pS” 
chology classes, seventeen and twenty-one from two introductory 
Spanish classes, and eighteen from a Spanish literature class. I? 
each case the entire class participated. ud 

Procedure. In an attempt to distinguish between high anxious 
and low anxious Ss, the A-Scale with the biographical inventory 
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ee by Taylor (6) was given. The Otis Higher Examina- 
SE apr Abilities (5) was also given. Both tests were ad- 
E cd at the beginning of each course. The rest of the pro- 
their W E similar to that used by McKeachie et al., who modified 
E hodology slightly from group to group. The specific pro- 
bun 2t in our experiment was as follows: 'The examination 
ihe cis of randomly presented, multiple choice items and was 
«di S examination given in the course for each group. Each 
Dur ur divided at random into an experimental and a control 
ra Che experimental group in each class received answer 
: s with the instructions: “Put an X through the best answer 
‘or each item. Feel free to make any comments about the items 
in the space provided,” and the control group received answer 
Pie with the instructions: “Put an X through the best answer 
or each item. Do not mark in the space to the right of your 
answers,” 


RESULTS! 


In alll five classes, the experimental group made fewer errors 
han did the control group. 


er last half of the examination t t ; 
ev etween experimental and control groups 1n each class, how- 
er, failed to reach the 5% point of significance. An F test between 
classes was made and no significant difference obtained, so the 
findings for all the classes were combined and the one-tailed bino- 
ll expansion indicated that the present results could be ex- 
| oe by chance approximately three times out of one hundred. 
pen dingly, the null hypothesis that there was no difference In 
ee between experimental and control groups on the last 
of the examination was rejected. n 
iE he largest introductory psychology class permitted us gar 
i ee in terms of the A-Scale. The sixty-one Ss were pm 
hi € median and classified as low and high anxiety groups. the 
( gh anxiety Ss in the experimental grouP were significantly worse 
did- 0.04, one-tailed t test) on the first half of the examination, but 
1d not differ significantly from the low anxiety Ss on the second 
aif (See Table I). An evaluation, in terms of gain from the first 


1 
Data de it possible t 
gathered i : ith another study made it possible to test 
MeKo Nae come i othesis on a limited number of 


‘achie et al’s “anxi +n 

S usi al/s “anxiety reduction” hyp! odd 

serine another E. "t "anxiety" [the Palmer perspiration index de- 
ed by Mowrer, (4)]. This analysis yielded non-significant results, 
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Taste I.—Errors on First AND Last HALVES or EXAMINATION 


Group First Half Second Half 
Experimental 
High anxiety 9.58 7.25 
Low anxiety 7.24 7.35 
Control 
High anxiety 9.00 8.82 
Low anxiety 8.43 6.78 


half of the examination to the second, showed that the high anxious 
experimental Ss were significantly superior to the low anxious 
experimental Ss (p « 0.005). When a covariance to remove the 
effect of initial scores on gain scores was run the resulting F was 
still significant (p < 0.02). This difference in gain scores is iN- 
dicated in Table I by the sharp reduction in errors for the high 
anxiety experimental group on the second half of the test, and bY 
the slight increase in errors for the low anxiety experimental group: 
For the control group, the high anxiety Ss made more errors tha? 
the low anxiety Ss (although not significantly more) on both the 
first and second half of the test; while the high anxiety Ss did 
improve somewhat on the second half of the test the low anxiety 
Ss improved even more.? 

The correlation between total number of comments and numbC" 
of errors on the second half of the examination was —0.19 for the 
high anxiety experimental group, and —0.40 for the low anxious 
experimental group, neither of which was significant. 

The correlation between Otis scores and anxiety scores was 


—0.05. 
DISCUSSION 


The finding that all five experimental groups were superior to 
their corresponding controls corroborates the findings of Me- 


^In a personal communication from W. F. Smith, we have learned that 
he and F. C. Rockett, did a related study using the Sarason anxiety scale. 
They did not analyze their results to see if high anxiety Ss showed a greater 
gen than low anxiety Ss on the second half of the examination, but theY 
varied type of instruction and although the effect of type of instruction was 
not significant, they found a significant interaction between type of instruc- 


tion. and anxiety level. High anxiety Ss benefited more from anxiety Ter 
ducing instruction. 
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Keachie e ; 
to reach (ori pine of the individual intra-class comparisons 
dng Ss the meis p * evels of statistical significance may well be 
The sharp vi Nsin our classes than in MeKeachie et al. 
armi te tha ee op in errors for our high anxiety experimental 
indeed cond half of the examination is similar to the be- 
Tarore diste saghe et al.'s low need for achievement group and 
en anxiety reduction hypothesis. 
dane ee = al. report a correlation for their low need for 
Stithiantion > "ee of 0.73 between scores on the last half of the 
Mec and number of comments, and for their high need for 
Sane oak group of 0.05. Our correlations, between number of 
1e last half of the examination and number of comments, 


of course, r : 
and inet reverses the sign so that in both MeKeachie et al.’s Ss, 
s, those Ss who made the most comments performed the 
oached the magni- 


bes F 

hue pe of our correlations, however, appr 

low A-Seal Keachie ct al.’s low need achievement group, and our 

high rini group actually had a higher correlation than our 

nificant, R en a although neither of our correlations was sig- 

seems Mino Jer than speculate on these results at this time, it 

make mor simply to note that there 1s à trend for those Ss who 
e comments to make fewer errors on the last half of 


their i 
s examinations. 
ne i i 
practically zero correlation between Otis scores and anxiety 
hip between 


Scor ant f 

Bicis certainly does not point toward any relations 
Xiety and intelligence in our sample, although we cannot of 
hat there may exist dif- 


Cours ; a 
ia definitely rule out the possibility t 
ces in intellectual makeup between our high and low anxiety 


Ss whi 
€ are not shown by the Otis (1, ?). 
€ present results, taken together with those of MeKeachie 


e = 
ai indicate the applicability of “drive reduction learning the- 
o the classroom situation. 
I SUMMARY 
graduate Fort to confirm findings by M« 
ps classes were divided into experimental and control groups. 
an P R were allowed to comment on items in 
ice T: examination while the control groups Were not. When 
Performe "i of the five classes are combined, the experimental Ss 
9f the exa significantly better than the controls on the second half 
mination. The largest class with an N of sixty-one was 


cKeachie et al., five under- 
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divided into high and low anxious Ss on the basis of their A-Scale 
scores. Anxiety was found to be a significant variable in terms of 
improvement from the first half of the test to the second in the 
experimental group. It was also found that those Ss who made 
the most comments showed the greatest improvement, but this 


relationship was not significant. Implications of these findings are 
discussed. 
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SEARCHING ORIENTATION AND 
CONCEPT LEARNING’ 


GABRIEL M. DELLA-PIANA 
Public School District No. 111, Highwood, Illinois 


Results of research during the first three decades of the twenti- 
eth century established that learning may be enhanced by giving 
the learner “knowledge of results.” MeGeoch and Irion (8, pp. 
265-268), in summarizing much of this research, conclude that 
knowledge of results has its greatest learning value when obtained 
with optimum immediacy, specificity, and frequency. Theoretical 
interpretations of these conclusions (1, 2, 9, 11), suggest three 
Processes by which knowledge of results may influence learning: 
(a) showing progress and thus “motivating” the learner. (b) pre- 
senting a standard and thus “guiding” the learner's trial responses. 
(c) indicating errors and allowing the learner to find out why 
the response was wrong, thereby “activating a searching orienta- 


tion” in the learner. , 

The present study assumes that a “searching orientation” may 
be induced by manipulating the method by which the subject ob- 
tains “informative feedback.” On the basis of this eee 
Predictions are made concerning how variations in eat 
Searching behavior may influence the learning of a at E 
formative feedback procedures" is a brief term used in Lai 
In lieu of the longer phrase “procedures for giving à lene? tation” 
edge of Which responses are incorrect." “Searching orienta on 
refers to the subject’s hypothesizing as to why a response was no 


j d 
Correct, loped by the subject are teste 
i ee ee of results. Presumably, 


Y him each ti ives ki 
4 ime he receives Kno 
this process leads to the learning, not only of correct sepone 
ut also of reasons underlying the correct responses. Thus, the 
3 and procedures 
A more tation of the hypotheses, data, ar 
of this study is 1 Ue omn M. Della-Piana's "Two ics ape pn 
ack Procedures: A Comparison of their Effects on the Sage o; * wi 
TPPIS" unpublished Ph.D. dissertation, 1956, un tio EN 16386, Uni 
ois Li E icrofilm, Publication No. 16,386, Uni- 
ibrary. Also available as PP ^ Arbor, Michigan, $1.10. The 


Versity Microfil i St 
ms, 313 No. First 5t. 
Study was done under the direction of Professors Lee J. Cronbach and J. T. 


astings, 
245 
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searching orientation was presumed to act as an intervening vari- 
able through which knowledge of results might influence learning. 
Underwood (11, pp. 414-417) mentions this "searching orienta- 
tion” (he calls it “attempt to discover”) as a variable probably 
accounting for the learning value of knowledge of results. Jones 
(6) and McConnell (7) conducted studies of classroom teaching 
procedures in which a "searching orientation" scemed to be in- 
duced by feedback procedures and appeared to facilitate *mean- 
ingful” learning. 


METHOD 


Two experimental treatments. Two informative feedback pro- 
cedures were developed to produce variations in "searching orien- 
tation” of subjects. These procedures constituted our two experi- 
mental treatments: one (dependency treatment) in which a subject 
was immediately told the correct answer when he gave an incorrect 
response, and the other (scarching treatment) in which a subject 
was told to keep trying until he “discovered” the correct answer. 
Our searching treatment is actually the standard “correction pro- 
cedure” used in experimental learning studies (10, p. 13). In the 
present study it is called the "searching treatment” because it W25 
assumed to induce the subject to develop hypotheses as to why 
his responses were wrong. The dependency treatment was devel- 
oped to produce a lesser degree of searching behavior than that 
produced by the searching treatment. We attempted to attain this 
aim by varying the feedback procedure so as to force the subject 
to make a response antagonistic to searching. The response we 
attempted to induce by the dependency treatment was the rela- 
tively non-inquiring one of “looking to the experimenter for the 
answers.” Differences between treatments, other than the theoreti- 
cally crucial difference, were minimized. For example, “time 0? 
item” differences were minimized; however, the degree of “search- 
ing” engaged in during that time was varied. We did not directly 
investigate the psychological process of “searching.” Instead, We 
investigated hypothesized differential effects which variations 1n 

degree of searching" presumably had on the learning of concepts 

Two ability levels. An attempt was made to utilize a two-by- 
two treatments by levels design. Guilford's Figure Exclusion Test 

(3) was used as a basis for dividing subjects into ability levels 
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S is 
-a nawe factors related to those called for by our 
forty sae the eighty-two students tested, we selected the 
Cn econ " o the Guilford test (twenty in each treatment). 
groups on th m most half of the differences between ability level 
righ cer 7 Figure Exclusion Test were lower than two times 
wey Gas A is of the difference between scores. Thus, we had 
un Mu te denee in the differences between ability level groups 
Ç, -— 
Bee significant differences between ability level 
able nete obtnaesl in this study on any of the dependent vari- 
cr — We may say that differences between levels, if they 
ike mi 5 ere not strong enough to show up in our sample under 
i ditions noted above. "Therefore, further data on “Jevels” 
not be reported in this paper. 
Nes concept learning task. An ind 
isos o by Heidbreder and Ivy (4) 
"rote d two reasons: First, it may b 
association” (naming the concepts 


fs or define their distinguishing chara 
define cem ul association" in which the learner can name and 
of the t ug Sounaps (4, P- 124). This was an essential characteristic 
WR ask used in this study since we expected our two feedback 
ris edures to differ with respect to the extent to which they facili- 
te ed one or another of these two kinds of learning. Secondly, the 
ask is similar in many respects to certain school learning tasks 
Where drill procedures similar to our experimental feedback pro- 
cedures are utilized; e.g. in the learning of basic arithmetic facts 


or sight vocabulary. 
coe: in this study, “concepts” 
parti es of a group of geome 
em cular concepts we utilize 

of shape (eg., *diamond-sh "), and two of number 
(e.g., “fourness”). Each of the six conce 


Syllable name, such as BLAG or LORB. 
The concepts were presented to the subject embedded in draw- 


ics on 3” by 5” cards. The subject was presented the drawings in 
s — series of six drawings each. Each of the six drawings in 
Kete MM one of the six concepts. Each concept was repre- 

in each series; however, in addition to the constant com- 


uctive concept formation task 
was used. We chose this 
e learned either through 
without being able to 
eteristics) or through 


refer to certain common at- 
ted on eards. The 
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ponent of color, shape, or number, which defined the concept, it 
had a variable component from series to series. Thus, the concept 
“fourness” was always represented by a drawing of four geometric 
designs. However, in one series it was “four, pink, triangular de- 
signs"; in another series it was “four, orange, rectangular designs"; 
and so on; but it was always four designs. 

The subject was asked to learn the nonsense syllable names of 
designs presented to him singly and successively in series after 
series of cards. The conditions made it possible for the subject 
to discover that a given nonsense syllable, though used as the 
name of very different designs in the different series, was always 
applied to designs which, taken together, possessed a common and 
distinguishing characteristic. Thus, “BLAG” was always used as 
the name of a design of the concept “fourness” (the design having 


four objects) though the shape and color of the design varied from 
series to series. 


DATA AND SCORES 


The hypotheses of the study will have more meaning to the 
reader after a brief description of the data and scores has bee? 
digested. There are three sets of scores: learning series scores 
post-treatment scores, and scores combining learning series 2n 
post-treatment measures. 

Learning series scores. In the learning series a concept was said 
to be "attained" when a subject named it correctly on the first 
trial (and all subsequent trials) of at least two consecutive series 
and when he never named it incorrectly after these two consecutive 
series. Thus, “concept attainment” refers to ability to give non- 
sense syllable names of concepts without regard to whether one 
knows the distinguishing feature of the concept. 

The raw data of the learning series consists of an error tally 0n 
a concept for each trial (within a series) in which the subject 
names the concept incorrectly. From these data we obtained a sum 
of series required for concept attainment and a sum of number 
of concepts attained. 


Two means, derived from these data, were used in testing the 
hypotheses of this study. 


(a) Mean number of concepts attained (NCA). This was ob- 
tained by summing the total number of concepts attained by each 
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person in a treatment and dividing this sum by the total number 
of persons in that treatment. 

"Es number of series to attain concept (SCA). This was 
"nine z ipii the total number of series required for at- 
dividing this Ae concept by each person in a treatment and 

Eo sum by the total number of persons in that treatment. 
habs reatment scores. Following the learning series, each sub- 
then ee given two tests: first, an open-ended definitions test and 
bts ee die definitions test. These tests presented the 
3 4 c with a list of the nonsense syllable names of the concepts 
rea Mises the subject first to write a definition of the dis- 
E z ing feature of the concept and then to select, from among 
lee es alternatives, the distinguishing feature of the concept. The 

score was the number of concepts correctly defined. On the 
HF apincore test this was called a “recall of meaning score" and on 
oe choice test it was called a “recognition of meaning” 
h Two means, derived from these scores, 
Ypotheses of this study. 
ms Mean recall of meaning (Re 
Him ite the total number of meanings reca 
i atment and dividing this sum by the to 
n that treatment. : 

(b) Mean recognition of meaning (RgM). This was obtained by 
Summing the total number of meanings recognized by each person 
a a treatment and dividing this sum by the total number of per- 

ons in that treatment. ‘ 
hus, the post-treatment measures allowed us to determine ai 
mean number of concept definitions recalled and recognized by 
each treatment group. 
fag ed learning series and post-treatment score. One mean 
Obtainei i measures. 
d by use of combined (MCA). This was 


ob a) Mean meaningful-concept attainment : 
tained by summing the percentage (decimal fraction) of at- 


tained concepts for which each person in a treatment could recall 
€ correct meaning and dividing this sum by the total number of 


Persons in that treatment. 
allow combining learning series an 
ce ed us to determine the mean percentage of 
Pts for which a treatment group could recall the 


were used in testing the 


M). This was obtained by 
lled by each person in 
tal number of persons 


d post-treatment measures 
“attained” con- 
definitions. 
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TABLE I.—Summary or TESTS or SIGNIFICANCE OF DIFFERENCES 
BETWEEN TREATMENT MEANS ON Four VARIABLES 
(d.f. = 1 and 36) (Treatment N = 20) 


Dependent Variable Measures 

Mean of |Mean of concept|Mean of concept) Mean fraction 

concepts definitions definitions | ofnamed çon 

named recalled recognized | cepts recalle 
Searching treatment 4.40 4.75 5.10 .97 
Dependency treatment 3.75 3.60 4.05 „81 
Difference .65 1.15 1.05 -16 
Mean square* 4,22 13.22 11.03 M 
F ratiof 1.84 8.58} 9.59} 12ł 


* Between treatments. 
f Using within treatments mean square. 
1 Significant beyond 0.01 level. 


TESTS OF HYPOTHESES 


Results of the major hypotheses of the study are presented n 
Table I. Although the study was set up with a two-treatments-bY* 
two-levels design no data on levels or treatment-levels interaction? 
are presented here because of non-significance of “levels” diffe" 
ences on both ability and dependent variable measures. 

Hypothesis I. The mean number of concepts attained (NCA) 
will be greater for subjects in the “searching” treatment than ! 
will be for subjects in the "dependency" treatment. egal 

As Table I shows, treatment differences were in the predicte 
direction, but were not statistically significant. Incidentally, they 
were significiant for low ability subjects taken alone. 

Hypothesis II. The mean number of series required for €9?; 
cept attainment (SCA) will be less for subjects in the “searching 
treatment than for subjects in the “dependency” treatment. 5 

A sign test for the significance of the direction of difference 
showed that only seven out of twelve differences between tre", 
ments were in the predicted direction. This gives us à z-score 
of only 0.058, definitely not statistically significant. 

Hypothesis III. The mean number of concepts for which on? 
can recall the meaning (ReM) will be greater for “searching 
treatment subjects than for “dependency” treatment subjects. 

Hypothesis IV. The mean number of concepts for which one 


? (raw score — mean)/standard deviation. 
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can recogniz : 
=. a s (RgM) will be greater for “searching” 
Hypothesis - Eye for “dependency” treatment subjects. 
‘ious pensentes ; : all the concepts attained by subjects, the 
recall the ais : tre concepts for which the subjects can 
treatment subi = (definition) will be greater for “searching” 
ike "Table 1 jects than for “dependency” treatment subjects. 
le it es boite concerning Hypotheses III, IV, and V, 
significant i he overall treatment differenees are statistically 
in the hypothesized direction. 


Gus DISCUSSION 
filie White ke of the above results may be summed up as 
it is shown a the restrictions of the procedures of this study, 
answer (S- hat subjects who are allowed to search for a correct 
answer dh compare with subjects who are told the correct 
(a) N -group) in the following Ways: 
correct] O significant differences in numb 
(b) na " 
tions eet differences in number of series presenta- 
(c) S. ame to name concepts correctly. 
definition: roup learns to recall and recognize 
(d) ^ les concepts than D-group. » 
ofa ab oup (more than D-group) learns to recall definitions 
There er percentage of concepts they learned to name correctly. 
study are certain qualifications and suggestions for further 
which may be noted at this point. 
onsequence of the 


very i j 
Positive s important suggestion arises as & C € 
e results of Hypotheses I, IV, and V. The rationale for 


t 
Poe d YPotheses was that the “searching” treatment, more than 
i inan a treatment, induced a set for “formulating and 
ivity ths ea and that this hypothesizing and testing ac- 
i mea as the main factor in accounting for treatment differences 
ave ee of learning. Now that significant differences 
ines cen found, it is advisable to investigate further along two 
. Suggested below. 


Probi an attempt should be made to get at the differences in 
pjects in each of the treatments. 


ade to get information on dif- 
between treatments. That is, now 


at w 
€ have found genuine differences between treatments on 


er of concepts named 


significantly more 
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our dependent variable measures, we want to know more ues 
cally how these treatments influence learning. Thus, the T 
remains open and testable: To what extent, are observed E 
ences between treatments due to “hypothesizing and d 
tivity” and to what extent are they due to “differences in moti 
tional characteristics"? . aw 
Further research on Hypotheses I and II is also suggeste 4 
two observations. First, an association interference variable ie 
not equated between treatments and operated against the pe 
dietion. Only one incorrect association could be made by sdi 
pendeney treatment subjects before they were given miae 
name of a concept card. It was possible for "searching we v 
ment subjects to make as many as five incorrect pup 
with a given card. For experimental purposes, it would be ue 
Sible and desirable to equate this variable between treatmen / 
Secondly, limiting ourselves to only ten series of cards serious y 
restricted the range of concept attainment scores for our -— 
of subjects. What effect more trials, and greater discrimination 
would have on our results is a matter for further experimentation: 


SUMMARY 
Two informative feedh 


02 
ack procedures were developed to P! 
duce hypothesized variati 


ons in searching orientation of subjects 
Searching orientation was not directly observed; however, Led 
dictions were made concerning how variations in searching orie ' 
tation would effect, performance in naming and defining conei o 
As predicted, the group assumed to do more searching learne aie 
give significantly more definitions of concepts. This major pr y 
tive result suggests the value of more direct study of the the 
pothesized psychological processes presumably induced by up 
feedback procedures. The hypotheses that the searching £O 


: : sup” 
would learn to name more concepts in fewer trials were not 
ported. 
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I s l 
n an earlier study (1), it was suggested that “ratings appear to 


T Pri predictors of that aspect of maladjustment which had 
deseri M society’s reaction to acted-out behavior, and self- 
bimer Ive data appear to be better predictors of what the person 
of this e chose to do.” This report examines the implications 
for thre n othesis by following the subsequent course of events 
Previous x and thirty-one children who participated e i 
measures. udy and provides validity data for the predictor 
ten rate criterion was needed for 
itiated b the hypothesis, i.e., one for eve 
in givi Y a representative of a soc t 
Brenta t an award or the police in making an arre ar 
suc] ìn which the individual person initiates action voluntarily, 

1 as in deciding to obtain mental health assistance or to with- 


Taw from school. 
stated particular criterion measures available for studying the 
Stude: hypothesis were: (1) designation by the school as an honor 
Brad nt in the mid-eleventh grade, and (2) withdrawal before 
uation from the twelfth grade. . 
4 of this report is based 


n Population on which the first par 
rin €s all the students in eleven ninth-grade classes of three 
eee County (Maryland) junior high schools which 

1950. This report also in- 


parti A 
l 
f the tests given to fifty- 


the examination of each 


nts involving actions in- 
h as the school, 


ing] 


ci A ; 
ciu S rapa in the previous study 1n 
an item analysis based on one 0 
k 
y Ü 
Study ae Psychologist, Prince Georges County (Md.) Mental Health 
enter, a demonstration activity of the U. S. Public, Health Service. 
tayri 
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one ninth-grade boys described by teachers as “seriously mal- 
adjusted” (out of a population of four hundred and four boys 1n 
the nine junior high schols included in the previous study). 


I 


The predictor variables for which scores were obtained at the 
time pupils were in the ninth grade were: 
(1) Forced Choice Test—devised to obtain teacher ratings 0? 
pupils 
(2) California Test of Personality—"Self" score 
(3) California Test of Personality— "Social" score 
(4) SRA Youth Inventory—Basic Difficulty score 
(5) Sociometrie rating 
'The criterion variables in this study are: 
(1) Appearance of name on high school honor roll in February: 
1953, that is, at mid-eleventh grade 
(2) Withdrawal from school prior to graduation from twelfth 
grade, that is, prior to June, 1954. : 
The use of honor roll as a criterion had certain inherent limita 
tions. A small number of children did not have an opportun! d 
to be considered for honor roll status either because of entrY a 
a high school which did not make such awards or because they b a 
moved from the community under conditions which were pi 
sumed to indicate continuance in school but which did not an 
information on honor-status to be obtained readily. Design®™ 
as an honor student is, furthermore, subject to the operation e 
numerous social influences, difficult to define and assess, 2D The 
sibly worthy of serious sociological study in their own right. ^7 
correlations reported below, therefore, may be somewhat & 
uated because of the inaccuracy of the criterion. «were 
The data on withdrawal from school prior to graduation, and 
more satisfactory. Of the original group of three hundre cho 1 
thirty-one students, information on in-school or out-of-s eig? 
Status was obtained on all but fifteen children (seven boys, ent? 
girls), or 4.5 per cent of the three hundred and thirty-one § M two 
(one hundred and sixty-nine boys, one hundred and sixty 
girls) in the population which was followed. a11 tbe 
Comparison of scores made by each criterion group on the 
five predictor variables is shown in Table I. The a “ig d- 
honor roll group and the means of the graduates are nearer t 
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TABLE I.—Scores or Honor vs. Non-Honor AND GRADUATE VS. 
Non-Grapvate CHILDREN 


a b c djé H g h $]3]|]1 
H Not on Grad- | Non- s 
EST tono Diff | t |r dd grad. Diff | t | r* | rt | rt 
N 39 | 277 255 | 61 

Score | Score Score | Score 
Forced choice 25.03/119.82| 5.21/6.9).45 21.42/16.46) 4.96)6.5).46).46).27 
Sociometric 35.08/51.49/16.41/5.9).45/46. 89/60. 23/13. 34/4. 9|. 39|. 42|. 22 
California self 73.9568.92| 5.03/3.0.22/70.54/65.38| 5. 16/2. 7|. 25|. 24|. 10 
California social |75.92/70.90! 5.02,3.2/.24/72.4907.48| 5.01|2. 9|. 26|. 25|. 20 
SRA basic diffi-[12.00/14.96| 2.96/1.8/.13/14.15]16.48| 2.383/1. 1/.11/-12/. 14 


culty 


* Biserial r, 
T Triserial r. 
1 Point-biserial r for sex. 


justed” pole of each of the variables than are the means for the 
non-honor roll group and non-graduates. (Signs of correlation 
coefficients are accordingly reflected wherever appropriate through- 
Out this paper, i.e., all coefficients are shown as positive.) For 
each variable except the SRA Basic Difficulty, the mr 
between means exceed the one per cent level of confidence, ue 
15, exceed a t-value of 2.58, and may therefore be regarded as 


Statistically stable. 

The relationships between each p 
Criteria are also shown in the form o 
In Table I, columns e and j. There app 


between à. E a it 
: the two criteria in the magnitu P : 
tion Coefficients obtained for each of the five predictor variables. 


€ largest difference between the pairs of coefficients for any 
Variable is 0.06 on the sociometric predictor, and this difference 
1S smaller than the standard error of each coefficient. The dif- 
erence m e due to chance. — ; 

ince the dn d pierdo coefficients is so nearly the 
Same for the two criteria, it is possible to consider the criteria 
as being perhaps different aspects of a single or general underlying 
Criterion Which may be described as *tamenability to school ps 
Tather than an indicator of the two different categories of e rents 
Postulateq at the outset of this paper. Accordingly each predictor 


redictor variable and both 
f coefficients of correlation 
ears to be little difference 
de of the biserial correla- 
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was examined against the combined criteria represented by three 
segments of the distribution, namely, the thirty-nine honor stu- 
dents, two hundred and sixteen children who were not on the honor 
roll but did complete high school, and the sixty-one children who 
left school prior to the completion of the twelfth grade. The tri- 
serial coefficients of correlation so obtained are shown in column 
k, Table I. 

On the basis of the data reported in Table I, it appears that 
ratings made by teachers and ratings made by peers are better 
predictors of both honor roll attainment and of withdrawal from 
school than are the scores obtained on the standard commercia 
self-descriptive personality tests employed in the original study. 

A review of the data, however, shows that important sex dil- 
ferences run through both the predictor and the criterion measures. 
When each of the triserial coefficients between the predictor an 
the general criterion is compared with the corresponding point- 
biserial coefficient for sex (obtained from the previous study 2? 
shown in column 1, Table I,) it appears that ratings have a large" 
correlation with sex than do the self-descriptions. It was also 
observed in the previous study (ibid., p. 40) that, “On the basis 
of correlational data, rating devices appear to have more in gor 
mon with, or to be more valid measures of, the adjustment of boy 
as a group than of girls as a group; self-descriptive persona ^ 
tests appear to have more in common with, or to be more van 
measures of, the adjustment of girls as a group than of boys 35 
group." re 

Furthermore, both criteria employed in the present study ae 
characterized by a pronounced sex differential. As shown 1D n: h 
II, girls tended more often than boys to be recognized as M 
achievers: 18.8 per cent of the girls attained honor roll, wier 
only 6.2 per cent of the boys did so. More girls (88.3 per werd 
remained in school until graduation than did boys (73.4 pe” r^ 
The differences between both pairs of percentages are stable, nt 
shown in Table II by a t-ratio of 3.4 for each. The phi-cocfiot 
between sex and honor roll is 0.19 (with a potential maxim i 
of 0.38) ; the phi-coefficient between sex and failure to gradu? 
is also 0.19 (with a potential maximum of 0.49). p se* 

Accordingly, the data of Table I were recomputed for each he 
separately and the results are shown in Tables III and IV. tio? 
correlations shown in Table IV show that the predictors func 
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TABLE IL—Srx DIFFERENCES IN ACHIEVEMENT 


Number Per Cent 
H Not on Non- H Not on Grade Non- 
ral | honor /Graduate] grace | "roll | Some mte | uk 
um 162 | 10 | 152 | 119 | 43 | 6.2] 93.8 | 73.4 | 20.6 
irls 154 29 125 136 18 18.8 | 81.2 | 88.3 | 11.7 
e 
Total 316 39 277 255 6l 
Sex differences 12.6 14.9 
t = 3.4 t= 3.4 


* 
Honor Roll of February, 1953 


In. different ways for the two sexes. Ratings for the sexes taken 
Separately remain generally superior to self-descriptions in pre- 
dicting the general criterion “amenability to school goals," but 
the raters? superiority is greater for the boys, whereas for girls it 
5 only slightly better than the self-descriptions. 

Teachers’ ratings predict the general criterion better than do 
ratings by peers, and they predict the small proportion of honor 
Toll boys best. The peers predicted withdrawal of girls relatively 
nefliciently, but they predicted honor roll standing about as well 
4S teachers did. To some extent the lesser efficiency in prediction 


Taste IIT. Mgaw Scores or Honor Rout, Non-Honor Rou 


Grapuares, AND NoN-GRADUATE CHILDREN 
B Girls 
joys 
Non- | Non- 2 
Non- | Non- Z [Honor | honor [read | Total) & 
Honor | honor | Goq-| Total] = oll | grad- eae | ^0 | 3 
roll | grad- ome in 3 S uate | "ate e 
uate 
N i amd ay | am 29 | 107 | 18 | 154 


Forced choi 18.44]21.95| 4.96 
A ce z 04] 5.9524. 69/21 . 80 j i 
al e Metric Fa bg big Ep 19.5936. 03/46.2352.8345.0818. 13 

fornia self  |71.20/59.3550. 42,08. 69] 11.22/74.90|70.51062.89 70.4512. 41 
°rnia social [73:1059, 36/67. 30/69.04/11-32/76-90|74.4267.89/74.12110.06 
,Uasie diff.| 9.99/12:30 13.79 12.59|11.70 12. 7216. 75/22.89 16. 7112.31 
y 
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TABLE IV.—ConnELATIONS BETWEEN CRITERIA AND PREDICTORS 
BY SEX | 
Criteria 
Predictors Boys Girls 
Honor Non- Com- Honor Non- Com- 
roli* | graduate*| positet roll* | graduate | positef 
Forced choice 59 46 50 38 42 40 
Sociometric 55 40 44 39 25 31 
California self 11 16 16 25 36 30 
California social 18 12 16 17 36 27 
SRA basic difficulty 12 08 09 23 30 28 


* Biserial r. 
T Triserial r. 


of withdrawing children by peers when compared with teachers’ 
ratings of these children may be due to the fact that in the 8000” 
metric device the focus was predominantly upon selecting the bet- 
ter adjusted children. . 

The ratings account for almost all of the predicted variance m 
the two sub-criteria for boys. For girls, the ratings account for s | 
relatively smaller (but still the major) part of the predicted jos 
iance and self-descriptive scores also contribute appreciably. Th 
self-deseriptive scores contribute particularly to the prediction ° 
girls’ withdrawal. * 

If we employ a correlation coefficient of 0.25 between Californ? 
Self score and the Forced Choice score for girls, as derived m t 
previous study (ibid.), these two predictors then jointly yiel 
multiple R of 0.49 with girls’ withdrawal, in comparison with b 
coefficient of 0.42 from teachers’ Forced Choice ratings alone; 6 
corresponding R for boys of 0.465 exceeds the coefficient of D 2 
(obtained from teachers! Foreed Choice ratings) by an insigni 
cant amount. It is evident, therefore, that the self-descriptive ue J 
employed in this study added mainly to the prediction of W1 
drawal in the case of girls only. ve 

From these data it appears that the conclusion quoted «P 
from the previous study on the differential susceptibility of n 
two sexes to the two general categories of measurement is the ar 
riding finding. Within this general finding, there is reason © o 
lieve that self-description will yield useful predictions of sc 


ll 


e 
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withdrawal of girls. While these self-descriptive scores do not 
equal the teachers’ Forced Choice scores in predictive efficiency 
for girls, they do contribute a significant increment to this predic- 
tive efficiency and they do not do so for boys. 

The results of these computations are consistent with the con- 
clusion of the previous study with respect to the greater validity 
of personality tests for predicting girls’ adjustment than for pre- 
dicting boys’ adjustment. Ratings are generally more efficient than 
self-descriptions for predicting both aspects of maladjustment as 
defined in this study, with one important exception: ratings by 
beers are relatively ineffective in predicting withdrawal from 
School by girls. Personality tests are relatively useful supplements 
to ratings by teachers for predicting the withdrawal of girls. This 
refers only to ratings by teachers when obtained by means of a 
Forced Choice Test. As shown in the previous study, categorical 
ratings by teachers do not appear to differentiate significantly 
among girls. 

The hypothesis stated at the outset of this paper is sustained in 
the following respects: Ratings are better predictors of honor roll 
Status than of withdrawal for boys only; self-descriptive scores are 
z predictors of withdrawal than of honor roll status for girls 
only. 


II 
Alth E iptive test scores had the lowest relation- 
uber i endo hypothesized that study 


Ship to manif, l adjustment, it was à 
9f the large Media it ae employed in such instruments might 
Yield some insight into the nature and causes of children’s failure 
m adjustment, Accordingly, an item analysis of the SRA Youth 
“Dventory was undertaken for a population of fifty-one boye who 
in the ninth grade had been rated as “seriously maladjusted” by 
their teachers, Of these fifty-one boys; twenty-five had dropped 
ut of school before graduation, twenty-five completed the twelfth 
Stade, and on died. 
An initial dieting of the Basic Difficulty scale of the Youth In- 
Ventory selected those items on which both sub-groups of twenty- 
ive boys differed from the normative male group, that is, jointly 
either surpassed or fell below the normative males. Table V shows 
the items on which percentages of boys in the group designated as 
maladjusted by teachers differed from percentages of boys in the 
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TaAnLE V.—lrEMs on Wuicnt “Matapsustep” Boys DIFFER From 
THE Normative Boys 


Malad- | Difer- | — t 


Normal |justeq* | ences 


A. Items to Which the Normative Boys Respond More Often than 
the “Maladjusted” Boys 


94 | I must always be ‘‘on the go" 30 18 12 | 2.14 
125 | I’m ill-at-ease at social affairs 26 | 10 | 16 | 3.64 
B. Items to Which the “Maladjusted” Boys Respond More Often Than 

the Normative Boys 


29 | My teachers don’t understand me 11 26 15 2.38 

32 | My teachers don't like me 5 16 11 2.12 

72 | I have trouble keeping my temper 27 44 17 2.39 

76 | I can't help daydreaming 29 46 17 | 2-36 

84 | I can’t see much future for myself 11 30 19 |2.92 

106 | I can't help feeling bad when I can't get! 6 26 20 3.17 

my own way 

110 | I'm always thinking up alibis 15 28 13 2.03 , 
113 | I can't do anything right 2 16 14 | 2-69 | 
158 | I feel that I'm a burden on my parents] 10 22 12 | 2-08 

198 | My parents neglect me 2 12 10 2.17 
269 | I feel that I'm not living up to my re-| 22 38 16 2.2 


ligion 


normative group by a t-ratio of 1.96 or more, significant at ihe 
0.05 level. Thirteen of the one hundred one items met this standar i 
On the basis of data presented in Table V, A, it appears tha 
the place of two items (94) and (125) in the Basie Difficulty Soale 
is open to question. Apparently more boys in the normative adoles, 
cent group described themselves as “I must always be ‘on the 8° 
and "ill-at-ease at social affairs” than did maladjusted boys. Pos 
sibly being “on the go” is an acceptable or even desirable part E 
the adolescent stereotype; and it may be acceptable for an adole?" 
cent boy to attend a social affair and feel ill-at-ease, whereas ma 
adjusted boys may avoid awareness of this experience by not ? i 
tending at all. 
The remaining items (Table V, B) which set maladjusted boy i 
most sharply apart from the normative group appear to deal wit? 
difficulties in establishing control of impulses (items 72, 76, 106) 
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and with a sense of failure to attain various social or school 
standards. 

A second independent sorting selected items on the basis of dif- 
ferences between the two sub-groups of maladjusted boys. Because 
of the small size of these two sub-groups, it was not expected that 
differences in percentages responding to the items when so classified 
would be very reliable. Those items on which difference between 
the two groups showed a t-difference of 1.50 or more, indicating 
that differences of the observed magnitude could be expected to 
occur by chance in thirteen of one hundred instances, are shown 
1n Table VI. Of the one hundred one items, fourteen satisfied this 
Standard. The differences observed, therefore, may have arisen by 
chance and the discussion below is only suggestive of hypotheses 
to be tested with larger populations. Nevertheless, in view of the 
fact that the patterns of response appear to have coherence and 
the fact that the data represent a follow-up involving a three-year 
Span, the items with largest t-scores for the differences between 


Tasty VI.—Irems SHOWING DIFFERENCES BETWEEN Twenty-Five 
MALADJUSTED GRADUATING AND TwENTY-FiVE MALADJUSTED 
WivupRAwiNG Boys 


Male rith- | t-Ratio 
- EXFL (GW 
Per ceni|Per cent Per. cent 
1.76 
6 | I wish I could be more calm when Ire-| 56 51 | 32 
cite in class 
T I hate school : 7 e^ "em 
es I wish I could quit school now 32 21 12 1.75 
za My teachers play favorites 32 14 8 2.22 
fe I'm easily excited " 15 20 1.80 
i 7 | I lack the drive others have e 4 |17 8 |1.58 
103 | Ir afraid of failure or humiliation to | 36 | 16 | 1.97 
a I wish I were more popular 44 | 24 | 24 1.53 
in I need to learn to be a good listener 0 7 16 - 
61 | I feel there's a barrier between me an 
my parents š 
162 | I can’t discuss personal things with my EIU ae ee 
parents 
E My parents play favorites : 1 ki e 
es 1 feel like leaving home ; E 
A Dthink.abouteex a good deal of the time| 24 17 8 |1.58 
6 | I wonder about the after-life 95 qo 8 [1.90 


266 The Journal of Educational Psychology 


percentages are reported in Table VI for such use as they may 
have in formulating hypotheses for future study. 

From Table VI it appears that the maladjusted boys who re- 
mained in school report that they feel impulse-ridden (items 7i, 
230); that they are anxious about failing to measure up (items 
103, 296) ; and they express a need to make a change in self (item 
132). These boys feel oriented toward adults: they notably do not 
suffer from feelings of inability to discuss personal things with 
parents (item 162) and they are highly concerned about the sup- 
posed partiality of their teachers (item 28). The maladjusted boys 
who withdrew from school felt that their parents played favorites 
(item 168) ; that there was a barrier between them and their par- 
ents (item 162). They were not concerned about teachers playing 
favorites (item 28). They felt that they lacked the drive others had 
(item 87), did not feel disturbed when required to recite in class 
(item 6), and did not aspire to greater popularity (item 119). They 
did not express fear of failure or humiliation (item 103) and could 
be described at the time of taking the test in the ninth grade as 9 


defeated group who openly sought to get away from both home 
(item 188) and school (items 11 and 13). 


SUMMARY 


This paper has indicated that ninth grade teachers’ Foreed 
Choice Test ratings of children’s adjustment are an indicator 9 
children's amenability to school goals and are generally superior 
for this purpose to self-descriptive personality tests. Sex differ- 
ences appear which are significant for practical prediction (i.e. 
selection) purposes. The pattern of correlations between the tw 
categories of predictor (ratings and tests) and the two sub-cate- 
gories of the general criterion (honor roll and withdrawal) is spe- 
cific for each sex. Self-descriptive personality test scores appeat 
to be relatively useful supplements to Forced Choice Test rating? 
by teachers for predicting withdrawal from school by girls. 

Tt has also been suggested in this paper that the maladjusted 
category included both (1) those children who had difficulty 1” 
control of impulses and who were concerned about attaining 8p" 
proval through school performance, and also (2) those children 
who had given up the hope of ever obtaining approval for schoo 
effort. It has not been shown in this or in the previous study that 
these two types of children could not be differentiated by teachers: 
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The data have shown that it may be possible to key items on self- 
descriptive personality tests to distinguish between varieties of 
maladjustment which have different functional significance for 
public education and public health programs. In particular it may 
be possible to discover, either through carefully formulated ques- 
tions to teachers or through experimentally derived keys for self- 
descriptive tests, which children would be readily amenable to the 
simple provision of therapeutic help and which children may re- 
quire more aggressive casework techniques. 
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DIFFERENTIAL SCORES AND UNSTABLE 
PERSONALITY 


ROBERT C. SCARF 
Ball State Teachers College, Muncie, Indiana 


The widely used psychological examination for college íresh- 
men, published by the American Psychological Association, 18 the 
basis of this report. The examination has two parts, a quantitative 
part and a linguistic part. Each freshman therefore has two scores; 
a Q-score and an L-score. The difference between the two is called 
an “unevenness” score. Most freshmen make scores that are fairly 
close, but a few show a big difference, being average in one score 
but close to the top, or the bottom, in the other while a rare case 
may be near the top in one, and near the bottom in the other. The 
median differential score, or unevenness score, is 16 percentile 
points, for one thousand nine hundred freshmen at Ball State 
Teachers College in 1948 and 1949. 

Two groups of freshmen were compared. In one group the scores 
were even, or nearly so, the difference was less than six percentile 
points, In the other the difference was great, at least forty per 
centile points. In making up these two groups, certain records were 
excluded—freshmen who were over the age of twenty-one; who 
had attended another college for more than one year; who Te 
mained in school for less than two terms; and whose total score 0? 
the ACE test was ranked below the thirtieth percentile. This 
thirtieth percentile is regarded as a safe boundary. Those above it 
could surely get at least passing grades. 

The data for men and for women were treated separately, and 
the data for freshmen who ranked higher in Q-score were treate 
Separately from the data for those who ranked higher in L-score- 
No significant differences appeared; therefore, all data are 7 
cluded in one comparison, 
roe uneven group was found to do worse on the basis of grade 

y were on probation, or disqualified for poor academic Wor 


more often. T ey earned more “F's”, and more “D’s.” These facts 
are shown in Table I. 


The uneven grou 
group; forty-nine 
an insignificant di 


P finally graduated almost as many as the eve? 
Der cent as compared with fifty-three per cent, 
ference, However, the uneven group took more 
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TABLE L—A 
—Acapvemic Recorp or Two Grours or COLLEGE STUDENTS 


Differential Score Per Cent Per Ci 
ee d N ier Gent on | Per Cont [pastus "b" Earning, "F" 
Below 6 113 10.6 0.9 37.4 11.4 
25.0 7.2 56.1 25.0 


Over 40 180 5 
. j 


All critical ratios are greater than 3. 


TABL = 
E IL—PznsisrENCE IN COLLEGE or Two GROUPS or STUDENTS 


: Per Cent S; i 
Average Terms | Excessive aod 


Differential Score N Per Cent 
Graduating in Attendance to Graduate 
—— 
Below 6 113 53 9.5 6.2 
ove 40 180 49 tS à 


more than the usual twelve terms, 
spent more than twelve terms 
t more time in 


e to graduate. 30.695 spent 
nen the even group, only 6.2% 
school Omen a degree. The even students spen 
Tabl » 9.5 terms, compared with 7.6 terms for the uneven group. 

n € II shows the data. 
ee comparison is possible. One comparison, shown in 
and thi , is based on the number of students, ie., of one hundred 
d irteen students in the even group, only one was disqualified 
oe scholarship, while thirteen out of one hundred and eighty 
h ents in the uneven group were disqualified. This does not tell 
€ story, as the time spent in school is also a factor. That is, a 
lification has 


s H 
eani Who spends six terms in school before disqua: 
etter record, in a way, than one who is disqualified after three 


t $ i 
xia This comparison, with terms in attendance as the basis, was 
€ and the results are shown in Table III. 


or Two GROUPS OF Cou 
TTENDANCE AS THE Basis 


* 
EGE STUDENTS, 


Tan 
LE III.—AcaApEMIC RECORD 
wita TprMs OF A 


Cent of |Per Cent 
rm: 


tol Per Cent of |Per Cent of 
D apg? 


Terms in Per on el 
Terms on (Disqualified 


Differential Score Attendance | probation 
ro 
Mex] hana ees 
Below 6 1072 1.5 0.1 11.4 1.7 
Above 40 1372 9.5 1.2 22.5 5.8 


All criti x 
critical ratios are greater than 3- 


et 
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From this table, it is seen that the uneven students were even 
worse than they seemed in Table I. It is surprising that the num- 
ber of graduates is nearly the same for the even and the uneven 
groups. However, the uneven students often spent more than the 
usual twelve terms in order to overcome their deficiencies, and 
finally earn the degree. 

Another way in which the two kinds of students differed was in 
the variation of grades received. The uneven students who received 
“D” at least once, also received "A's"; and the number of “A’s” 
was as great as for the even students. Of the even students, forty- 
four per cent received at least one “A,” while forty-eight per cent 
of the uneven students received at least one “A.” It was concluded 
that the poor academic records were not due to lack of ability- 

The two groups were also compared with respect to their total 
raw scores on the ACE. The result of this comparison is shown in 
Table IV. The data are given for each year separately, as different 
editions of the ACE test were used each year; in 1948 the 1946 
edition, in 1949 the 1949 edition. The differences in the mean 
Scores were insignificant. The conclusion is that the uneven stu- 
dents were substantially equal to the even students in gener 
mental ability. 

The large number of “D” and “F” grades were due to some othe 
factor, presumably to defective personality. 

To study another aspect of personality, all deans, counselors; 
and dormitory directors were asked to name the students who were 
regarded as queer or troublesome, or who had been involved iP 
disciplinary action, or who, after leaving college, were known ig 
have had serious difficulty. The psychosomatic cases all came from 
the writer’s own file. Psychotics were listed who had been referre 


os psychiatrist, or who were known to have been sent to a ment? 
hospital. 


TABLE IV.—Tnz Toran ACE Sconzs or Two Groups or STUDENTS 


Standard 
Year of Entrance Kind of Student N Mean Rd Eror of the 
a c mer onm | Je 
Y aot 99 115.8 11.1 2.8 
“ven 54 117.5 14.6 
ai Uneven 81 107.0 10.8 1.8 


Even 59 108.8 11.1 


N 
| 


D. T 
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TABLE V.—DirrERENTIAL Scores OF UNSTABLE PERSONALITIES 


Median {Per CentAbove 


Category N Differential | Median of All 
Score Freshmen 
Defective personality 78 35 96 
Stutterers 5 29 80 
Disqualified (non-academic) 5 43 100 
Psychosomatic cases 22 33 74 
Exeluded from practice teaching 12 31 83 
Suicide 12 31 92 
Murderer 1 34 — 
Frank psychosis 16 9 18 


The list included students with psychosomatic complaints, gen- 
eral lack of adaptability, stealing, homosexuality, poor discipline, 
Suicides, and one murder. Records below the thirtieth percentile 
on the ACE test were disregarded, as it was desired to exclude 
those whose difficulties were complicated by mediocre intelligence. 

The differential scores of these people are shown in Table V. 
Ta general, the differentials are very high. It should be recalled 
that the median differential score is 16 for freshmen. For each 
category, the per cent of cases who exceed this, and therefore, can 

€ regarded as abnormal, is given. These percentages are also high. 


The list į i i Jete. It depends in part on mem- 
st is admittedly incomp!e P However, nyohefyae 


Ories, going bae] teen or twenty years. A 
Ineluded Shas pont not adequate, showing good evidence 
of personality defect. Those excluded from practice teaching ve 
excessively timid, or extremely rigid and severe, or had some other 
efect. Academic deficiency was not the cause of exclusion. 
The finding for frank psychosis is surprising. These students 


Were commi itals, or were known to have con- 
mitted to mental hospitals, ares Jed mo-ipport 


Sulted a nie : tial sc 

psychiatrist. The low differentia! sco y 1 
to the usual statement that psychosis is very like neurosis, only in 
More Severe form. 

CONCLUSION 

„The conclusion is that the psychological examination, often used 
with college freshmen, is a fairly good prognostic device. It is 
Possible to use the examination to Say which freshmen will have 
Personality defects, or neurotic and unstable personalities, but it 


i : 
8 Useless as a predictor of psychotics. 
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Some theory which accounts for these facts is called for. Prob- 
ably the best one is that the stable, well-balanced freshman at- 
tacks any task with equal zest and does his best on all tests. The 
unstable freshman, on the other hand, will do his best on one, but 
will be upset on another, resulting in uneven scores. The uneven- 
ness is therefore a sign of his total personality. 


SEX AS A VARIABLE 
IN INTELLIGEN 
PERFORMANCE ei 


MORONI H. BROWN and G. ELIZABETH BRYAN 


University of Utah 


Sex differ Me z 

B ah dels ripe intelligence test performance have been dem- 
(1,2, 6, 8.9 "n Mendes (10, 18, 15) and in later studies 
fallipenos 9nd 1, 12) of the three most widely used individual in- 
the Wechsler feat Stanford-Binet, the Wechsler-Bellevue and 
stp have r Intelligence Scale for Children (WISC). The differ- 
scales i gig de vag though, in the construction of these 
titans ham En minimal sex differences were selected. Most 
when the 2 = hard-put to explain these differences, especially 
sults for ake igence scales are supposed to yield comparable re- 
(10) have aries females. As Seashore, Wesman, and Doppelt 
logically Bari , there seem to be three major possibilities which 
teste in the s mg for sex differences. These are: (1) that the 
Performance or es are unbiased and one sex actually is superior in 
ability send E the tests; (2) that the sexes are equal in mental 
in TEA al e chosen test items turned out to be slightly biased 
iden one sex; or (3) again assuming that general ability is 

X differentiated, that the sampling of one of the sexes was 


SO. 

i chosen with a slight bias. / 
Sex di a pa to the rigorous avoidance of actually hypothesizing 
Studies " ences in intelligence test performances, the published 
conus, ten carry a small apologetic sentence tucked away 1n the 

ion section to the effect that the observed differences were 

r to test bias. . 
individual intelligence scales, the 
ifferences in the 
nent tests which 
in the 1937 revision of the 


era] ^ 
en which yielded the largest 

included i; d that a considerable numDe © 

ferences in the revised scales showed statis 

iferences the percentages of success for males and females. These 

combin ^à are supposed to largely cancel out when the tests are 

ed, but even so, on the Full Scale, the authors reported a 
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two-point superiority for boys between the ages of six and thirteen 
years. Above the age of fourteen, there was a four-point superiority 
for boys. These sex differences were obtained on their standardiza- 
tion group of two thousand four hundred and forty-four subjects 
ranging from five to eighteen years of age. Their comment oD 
the observed sex differences was that there was a possibility that 
some of these differences were due to biased sampling. 

Terman and Tyler (12, p. 1068) report a study of Scottish school 
children with the 1937 Stanford-Binet Scale in which sex differ- 
ences of 3.24 IQ points were obtained in favor of the males. The 
comment on these results was that the direction and magnitude 
of sex differences in intelligence is primarily dependent upon Lal 
test used. In other words, it was due to a test bias. 

Seashore et al. (10) in an article on the standardization of the 
Wechsler Intelligence Scale for Children, found that girls excelle 
boys at the ages of five, six, seven, and nine on the Verbal, Pen 
formance, and Full Scale IQ scores, but that at the age of eig? 
and from the ages of ten to fifteen, boys excelled girls on all thre? 
IQ scores. The greatest differences were in the Verbal scores at t A 
age of eight and from ten to fifteen where the boys' superior! 
ranged from three to five IQ points. Although the significance en 
these sex differences was not given in the article, they were rn 
puted by the present writers and some of them were found to 
statistically significant (0.05 level). . b 

Wechsler (15) stated that there were no statistically significa?” 
sex differences in total scores for the group on which the Wechsler 
Bellevue Scale was standardized, but that women tended to [an 
higher mean total scores at almost every year level. This state” 
ment seems to imply that there might have been statistically ud 
nificant differences on some or all of the tests in the Scale. There. 
some question about what Wechsler was referring to when he sa t 
that “these differences in themselves do not prove anything» » 
taken together with other evidence, have led this writer (that p 
Eee to the belief that it may be possible to demonstr: we 
ener età of women over men so far as general ^h ye 
aes neerned.” He then went on to say that he did not ven 

sary data as yet to back up this generalization (e 


though he had ; trib 
Just m “ i ” had contr 
uted to his belief) entioned that “other evidence" ha 


Despi " z à o 
espite Wechsler’s attempt to construct a scale in which ? 


-—————— eee 
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total ; 
reported e would obtain, such differences have been 
cally signifi er studies. Strange and Palmer (11) noted statisti- 
{hits ast ae differences favoring males on nine of the sub- 
the test record: Age IQ scores. In their study, these authors used 
ninety femal e a group of one hundred and forty-five male and 
in the earl tt à ult psychiatric out-patients whose mean age Was 
superiority o : airties. Contrary to Wechsler's statement about the 
showed highe Osan intelligence, the Strange and Palmer study 
level for Ve b Lan es for males which were significant at the 0.001 
forono IQ a = Full Seale IQ's and at the 0.02 level for Per- 
ploratory aat EM writers mentioned that theirs was an ex- 
Served ina s y, but even so the sex differences which were ob- 
sex differen ample as large as theirs should serve as a warning that | 
dhele Hell. should be checked in further studies which use the 
Metas evue Scale since such differences might change the 
Followin n of the results. 
Goolishian e Strange and Palmer report on sex differences, 
hundred domi oster (6) used a collection of test records of three 
and ninety-two adult psychiatric male and female pa- 


tients 
Whose mean age was about thirty years. The results showed 
or better favoring 


Sex di e 
supo DM significant at the 0.05 level 

On five of the subtests and on all three IQ's. The sex differ- 

0.001 level, the Per- 


publie sc 
f each sex at five ag 


and ! 
Don-referred subjects from a 
1l five age levels favoring males 


tio 
nin cin were forty subjects 0 
9n the F du sex differences at à ring 
OWeve H Seale weighted scores but none of them was significant. 
an di "i on the Verbal Scale, males had higher weighted scores 
nificant females and at three age levels the differences were Sig- 
differ at or beyond the 0.05 level. There were less pronounced 
ade, deen on the Performance Scale but at four of the five age 
males, Me subjects had weighted scores equal to or higher than 
- The two oldest age ranges showed sex differences significant 


att 
eed and 0.05 levels, respectively. . 
clle ugh the above mentioned studies using the Wechsler- 
vue all indicated sex differences generally favoring males, 
bjects in these studies were 


it sh 
10uld be noted that the samples of su 
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not similar. The first two studies used the test records of psychi- 
atric subjects exclusively while the third study included only non- 
clinic and presumably “normal” subjects. In addition, the first 
two studies used unequal numbers of male and female subjects. 
More importantly, only the mean ages were reported for the psy~ 
chiatric groups and no information regarding the age ranges OT 
age distributions was given. This is a serious limitation since 12° 
telligence test performance has been shown to vary with agè 

so that one cannot assume constancy at different age levels. 

To the extent that the Stanford-Binet, the Weehsler-Bellevues 
and the WISC are valid measures of intelligence, the studies here 
quoted all indicate that there are sex differences in the levels ° 
over-all intellectual functioning. However, it seems that à pertine? 
question in regard to sex differences concerns the rationale for 2 j 
tempting to eliminate differences in performance between the sexes 
on intelligence tests when such differences are readily accepte m 
areas which deal with physiological and emotional reactions, yoce 
tional aptitudes, interest and personality inventories, etc. Some pow 
the possibility of sex differences in intelligence test performance 
seems to have been an unacceptable hypothesis. This resistance ê d 
parently is related in part to a conviction that sex difference 
could not exist because this would imply that one sex is supe}? 
to the other. Consequently, the explanations of any demonstr®; 
sex differences usually have incorporated either test or sampli”! 3 
bias as the major contributing factor. In contradiction to ghi 


ences may exist and, particularly that they do, in fact, exist 58 
the currently used intelligence scales so that intelligence 35 ur. 
measured is not equal for the sexes regardless of whatever in here? 
differences may or may not be present. If the “amount” 0” ge 
telligence is not the issue but rather the "kind of intelligent 
test performance,” then it should be possible to retain a tes 
^ scale as a useful measuring tool as long as male and fe 
kelly. 5 alls ished for that scale. It might be noted, incide 
there ak i) m the Cornell-Coxe Performance Ability Scale ale 
and female ^s NS of smoothed IQ scores for the combine! 
separate t ^ andardization subjects at different age levels, 
ables of norms for male and female subjects are pr 


sented and the jn 
i y show i les to obt? 
higher scores at all Ue D o tendency for males 
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Perha indivi 
dir — = idual scales which are being used at the present 
onai a |i ees if, in their standardization, the com- 
had been siste pe different results for males and females 
formance of ^ . Inasmuch as the differences between the per- 
ee with age, it would 
they were iem standardization norms would be more valuable if 
is D gl qm for discrete sex-by-age groups. The individual 
jnbétadt and "e and sex is not sufficient since both variables 
sistency of th heir relationships are not constant. This incon- 
strated by e bor d sex variables has been indirectly demon- 
studies which I ack of agreement among the results of various 
8,9, 18, and 1 nave used samples of different age ranges (8, 4, 7; 
Wadhsls ee d . Direct evidence for age-by-sex interaction on the 
not only did i has been reported (1) and it was shown that 
re eek i Performance, and Full Scale scores vary 
of the soiseidind ets ages but also that the weighted scores 
s M üt subtests varied in their rank order from one age 
Rabe and allowing for sex differences, it might be 
more B a psychometric device which would prove to be 
As it is ae to unusual or pathological intellectual functioning. 
sex di fiere ever, no such general intelligence scale exists. Since 
recogniti nees are present in the currently used scales, then the 
onto th on of these differences at least should throw some doubt 
abilities, conservative and glib generalization that intellectual 
are equally distributed between the sexes. 
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PURPOSE OF THE STUDY 


Previous research has frequently revealed that various lin- 


guistic skills such as vocabulary, reading comprehension, and 
h is related to scho- 


IM ing are positively related and that eac 

astic achievement, Previous research has also revealed that each 
of these linguistic skills is positively related to intelligence of 
the symbolic type as measured by the usual group intelligence 
test, However, little research has been forthcoming concerning 
ss degree of relationship remaining among these linguistic skills 
I between them and scholastic achievement when the intel- 
‘gence factor with which each appears to have some degree of 


is turation is ruled constant experimentally or statistically- This l 
the problem investigated in the present research. . 
s s to identify the following 


r vA cifically, the present study seek 
elationships: 
(1) The coefficients of correlation between objective measures 
9! Vocabulary, speed of paragraph comprehension, and spelling. 
v. The coefficients of correlation between intelligence scores 
prel Objective measures of vocabulary, speed of paragraph com- 
Jension, and spelling. . 
(3) "The coefficients of correlation between college grade point 
279 
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averages and objective measures of vocabulary, speed of para- 
graph comprehension, spelling, and intelligence. 

(4) Partial coefficients of correlation between each of the fol- 
lowing with intelligence ruled constant: vocabulary and speed of 
paragraph comprehension, spelling and speed of paragraph com- 
prehension, and spelling and vocabulary. 

(5) Partial coefficients of correlation between each of the fol- 
lowing and college grade point averages with intelligence rule 
constant: vocabulary, speed of paragraph comprehension, an 
spelling. 


PF Uy e 


MEASUREMENT OF VARIABLES f 


The Nelson-Denny Reading Test, Form A, yields objective 
measures of both vocabulary and speed of paragraph compre 
hension. These scores were used as measures of these abilities in 
the present research. The spelling test of the Differential Aptitude 
Tests was used as a measure of ability in spelling. Total scores 
on the 1952 edition of the American Council on Education Psy- 
chological Examination were used as measures of intelligent" 
Each of these tests was administered as a part of the freshm? 
testing program at Panhandle Agricultural and Mechanical ix 
lege. Grade point averages used as criteria in this investigatio" 
were first semester averages. The subjects for the present 2 
vestigation numbered one hundred seventy six and represen ol 
the entire 1955-56 freshman class of the above college who T 
mained for one semester and for whom complete test resu 
were available. 


RESULTS OF THE STUDY 


In attacking the problem, the first step was that of ascertain 
ing the relationships present among the three linguistic £ r- 
with which the study was concerned. These coefficients of co 
relation are shown in Table I. 

The coefficient of correlation between vocabulary and spee 


d of 


Taste L—RzLATIONSUIP Between Pains or Linaurstrc SKILLS 


Pairs of Skills — 
Voeabul 
Bellini sud one Speed of paragraph comprehension 0.621 
Speed of paragraph comprehension 0.408 


Spelling and vocabulary 0.668 
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paragraph comprehension was 0.621, indicating a substantial 
ars relationship. The correlation between vocabulary and 
do - was 0.668, also indicating a substantial positive relation- 
ihe 6 Eee spelling and speed of paragraph comprehension 
lionis T] on was 0.408, indicating a moderately strong rela- 
E a. hus, the present results agree with those usually found 
nite Uar studies in that these three linguistic skills exhibit defi- 
ee mare interrelationships. Under such conditions one may 
a vorn d hypothesize the possibility that at least a part of 
cei errelationship may be due to the saturation of each with 
cina eie factor, and proceed to test such a hypothesis. The 
Fate study hypothesizes that intelligence is such a common 
Es : present in each of these separate abilities and seeks to 
the See what relationships between these skills remain when 
the Influence of this common factor is removed. Table IT shows 
$ relationships found between each of the three skills and in- 
elligence, 
ros the data in Table II it is seen that each of the three lin- 
Era measured is strongly related to intelligence, the co- 
telli nts of correlation being 0.708, 0.659, and 0.573 between in- 

gence and vocabulary, speed of paragraph comprehension, and 
5p elling, respectively. These data seem to support the premise that 
Intelligence may be a common factor accounting at least partially 
for the strong relationship existing among the three variables. The 
next logical step in the testing of such a hypothesis is the partial- 
ing out of that portion of relationship between pairs of linguistic 
skills due to the common factor of intelligence and identifying the 
degree of relationship remaining. The results of this process are 
shown in Table III in the form of partial r's. 

he correlation coefficients shown in the above table are those 

etween pairs of linguistic skills with intelligence ruled constant 

Statistically, By referring to both Table I and Table III one may 
Observe the decrements in degree of relationship between these 


TABLE II.—RELATIONSHIPS BETWEEN LINGUISTIC SKILLS AND 


INTELLIGENCE 
r 


Linguistic Skill 
_—————— 
0.708 


0.659 


Vocabulary E 
hension 
0.573 


Speed of paragraph compre 
Spelling 
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Taste III.—RELATIONSHIPS BETWEEN Pairs OF Linguistic SKILLS 
WITH INTELLIGENCE RULED CONSTANT 


Pairs of Skills | Partial r 
Vocabulary and speed of paragraph comprehension 0.290 
Spelling and speed of paragraph comprehension 0.049 
Spelling and vocabulary 0.452 


pairs of skills when the influence of intelligence is removed. Thus, 
the correlation between vocabulary and paragraph comprehension 
drops from an original 0.621 to a modest 0.290. The correlation 
between spelling and speed of paragraph comprehension drops 
from an original 0.408 to a statistically insignificant value of 0.0 
The correlation between spelling and vocabulary dropped from ? 
strong 0.668 to a moderate value of 0.452. oo 
It is seen, then, that the removal of the common factor of a 
telligence very definitely reduces the relationships among we 
linguistic skills. The relationship affected most by this remov s 
appeared to be that between spelling and speed of paragraph cor 
prehension, this relationship apparently being almost em 
accreditable to the relationship of both factors with the po 
variable of intelligence. Also considerably affected was the re 
tionship between vocabulary and speed of comprehension. It po s 
appear that much of the strength of the relationship between the n 
two abilities is attributable to intelligence since the correlatio 
coefficient is considerably reduced by its removal. However; i 
of the relationship between these two variables is not due i 
saturation of both with the intelligence factor since there remal 
a definite positive relationship when its influence is removed. be- 
observed decrement in degree of relationship is not $0 great her 
tween spelling and vocabulary as those observed among ot iie 
pairs of linguistic skills when intelligence is ruled constant statis 2 
cally. Although there is a reduction in the coefficient of correlati " 
when intelligence is partialled out there remains à fairly SU 
stantial relationship. n 
ite ee further concerned with the relationship n 
Temang bet scholastic achievement and with the t M scho" 
luda acidve ween each of the three linguistic skills Es x Lond 
stant. Tabl T e when the variable of intelligence is rule E 
€ IV shows the relationship found between each 0 
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TABLE IV.—RELATIONSHIP OF ALL VARIABLES WITH 
COLLEGE GRADE AVERAGE 


r 


Variable 
—" 
Vocabulary 0.513 
Speed of paragraph comprehension 0.433 
Spelling 0.557 
0.518 


Intelligence 


— variables and college success in terms of grade point aver- 
E jeter r in Table IV it may be see 
Poss, "The En bs "s es with scholastic 
ACE ndo: hip | etween intelligence as measured by the 
eius cholarship is similar to that usually found by other re- 
EA Those coefficients of vocabulary and spelling with grade 
Émitol ds surprisingly high considering both the brevity and the 
iene ae of the tests of these abilities in comparison with the 
When d more comprehensive ACE. 
iilis and hat part of the relationship b 
pare scholarship which is attributable to | 
tained queue was partialled out, the data in 
of t he degree of remaining relationship is S 
partial r's, 
NL partial r of 0.144 remaining between speed of paragraph 
ai pnp and scholastic average with intelligence ruled con- 
i is not statistically significant. The partial r's between vo- 
enol and spelling and grade average are significant, at the 
tion level of confidence. In all cases there 15 à decrement in rela- 
wi ship observed between the linguistic skills and scholarship 
i en the influence of intelligence is removed. However, there re- 
ans a definite positive relationship for both spelling and vocabu- 


l n 
ary with scholastic success. 


Tape V.—Rexationsmr BETW 
AND GRADE AVERAGE WITH INTELL 


n that all of the vari- 


etween the three linguistic 
le to the common variable 
Table V were ob- 
hown in the form 


EEN LINGUISTIC SKILLS 
IGENCE RULED 


CONSTANT 

Dan 
Vocabulary 0.245 
ehension 0.144 
0.373 


Speed of paragraph compr 
Spelling 
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SUMMARY AND CONCLUSIONS 


From the results of the present study one may safely make the 
following generalizations: 

(1) The linguistic skills of vocabulary, paragraph compre- 
hension, and spelling are related substantially with one another, 
and all are related substantially with intelligence. Each of these 
variables is also related substantially with scholastic success. 

(2) Much of the strength of the relationships present among 
these linguistic skills is due to a common saturation with intelli- 
gence. Much of the strength of the relationships present betwee” 
each of these linguistic skills and scholarship is due to their com 
mon saturation with intelligence. 2 

(3) Although the removal of the influence of intelligence m 
duces the relationship between vocabulary and paragraph egi 
hension, there remains a relationship sufficiently strong to justify 
attempts to improve reading comprehension through vocabulary 
training. " 

(4) There is no significant relationship between speed of pera 
graph comprehension and spelling ability when the influent y 
intelligence is removed. This would seem to indicate that the § d 
reader is not necessarily a better speller than the fast reader 
equal intelligence. me- 

(5) Although the removal of the influence of intelligence wo 
what reduces the relationship between vocabulary and ga E 
ability, there remains a fairly strong relationship between th 
skills. an 

(6) There is no significant relationship between speed of ri 
graph comprehension and scholarship when the influence ofin in 
gence is removed. This would seem to indicate that programe ip 
speeded reading can do little to influence the quality of scholars i 
although they may serve to reduce considerably the time spen 
acquiring a given scholastic proficiency through reading. P 

(7) Although the removal of the influence of intelligence i 
duces the relationship of vocabulary and spelling with schola à 
meh there remains a significant relationship we y 
ing ion Se p ship. This would seem to Mes port e 

ic o 
programs in re concentrate on vocabulary develop 


medial spelling offer definite possibilities for im 
provement of genera] scholarship. 


; f 
(8) Research which goes no further than the computation 9 
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zero order relationships between symbolic skills is incomplete and 
may actually be misleading. Future research in this area must 
recognize the possibility of common saturations and explore these 
possibilities in order to be scientifically complete. 
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AN EVALUATION OF TWO TYPES OF 
KINDERGARTEN ATTENDANCE 
PROGRAMS! 


EVA DREIKURS FERGUSON 


Northwestern University 


Tn line with the growing trend in preschool education to decrease 
veek, the 


the number of hours of kindergarten attendance per W 
center in which the present study was made established a program 
of spaced attendance modified from a suggestion made by Gesell 
and his co-workers (2). The Yale group suggested that the assimi- 
lation of the newly learned skills gained through a kindergarten 
experience was provided to a greater extent by a spaced than by 
a continuous attendance program, especially for the three-year-old. 
Additional advantages allowed by spaced attendance were that 
ìt provided opportunity for mother and child to have outings 
together on the days the child was home, and that it would be less 
atiguing for the child to have the extra days of nonattendance. 
The modified spaced attendance initiated at the center in which 
this study was made consisted of five mornings of attendance every 


ds weeks, the children attending three days one week and two 
am ran successfully for 


days the next week. Although this prog" t 
More than two years, other preschool centers in Australia were | 
reluctant to follow in this venture until it could be adequately 
demonstrated that spaced attendance was not depriving the child 
of the social and psychological development that was currently 
Provided by daily attendance. 
of E» nera of the present stu 
Ppa gram 
able dum MiA Be na between daily and spaced attend- 
ance, with regard to the children’s social, psychological and physical 
evelopment. Specifically, the two attendance programs were 
— 


therefore an evaluation 


dy was i 
A hether any notice- 


by testing W: 


7 - 
* The present study was made in partial fulfillment for the moe = 
gree in the Psychology Department, Melbourne University. The ata E 
Sathered at the Lady Gowrie Child Center in Melbourne, Australia. | e> 
Cause no uniform terminology exists in the literature, the terms “‘preschoo 
Center” and “kindergarten” are used interchangeably in the present study. 
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compared on the following points: (a) the individual child's be- 
havior, as displayed in free-play periods; (b) social cohesion © 
the total peer group; (c) physical development of the individual 
child; (d) attendance records and reasons for absences; (e) the 
individual child's behavior and development at home, as reporte 

by the mother. The study covered a six-month period, from March 
to August, which included the fall and winter seasons in Australia. 


METHOD 


Groups 


The children used in the study were between the ages of three 
and five. They came from a homogencous, working-class neighbor 
hood, with over seventy per cent of the parents having eig? 
years or less of education. (Since in Melbourne the age limit for 
compulsory education is fourteen years, this sample is typical ° 
the Melbourne working class.) Fifty-six children were select? d 
from the available kindergarten population of the center. 
majority of them (forty out of the fifty-six) had attende nil 
center the previous year. Two groups, each of twenty-eight 9 8 
dren, were formed by matching Ss in pairs on the basis 9 
following variables. 

(1) Age, as measured in half-year units. of 

(2) Type of family background. This included: (a) the ty p ne 
discipline and its degree of consistency used on the child 1? 
home; (b) the degree of manageability of the child at hong 
amount of family harmony; (d) sibling status, measured acco" se, 
to whether the child was an only child, had siblings close im ^ 
or had siblings six or more years older than himself. 

(3) Adjustment to peers and to kindergarten routi 
end of the previous kindergarten year. tailed 

The data on the family background was obtained from de 
records which are routinely kept for all the children at the a e 
by a social worker. Those children who had attended the 68 Jes 
the previous year were rated on a 3-point scale on the We 
bi ie a under home background. The reliability of the va ding 
Peck eg med against the director's judge sod on 
personal cont; ome background, ber judjntess Lae E reliabilit 
din found acts and home visits with the family. er cont 

to be high, agreement being above eighty P 


ne at the 
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vide qe to peers and to kindergarten was based on a rating 
(iiim y the teachers. At the end of the previous year, the 
ML Pur ed all the children at the center who were to continue 
ae e kindergarten for the year of the study? 
Sese ova who had not attended the center the previous year 
co dem in only on age and sex, as no adequate information 
sepia a background was available. As far as possible, Ss 
Men e ; igne randomly to the two groups, although completely 
in i-r p nm was not possible. (In cases of siblings it was 
Mer ae * D S as it was desirable to have siblings in different groups, 
ite diim ew cases where matching of the pair was not complete, 
dien ren were so assigned as to keep the distribution of the 
atching variables equal in the two groups.) After the groups were 


for s 
ormed, the director of the kindergarten, without knowledge of 
p, arbitrarily designated one 


ated. The two groups 
for spaced attend- 


a 
nce, and the D-A group, for daily attendance. The 8-A group 


ew Tuesdays and Fridays and alternating Wednesdays; 
both -A group attended every day. The hours of attendance for 
ni groups were nine to twelve for the first three months and 
pie aad to twelve-thirty for the last three months, the latter 
i urs being part of the center policy to allow the children to sleep 
Ohger on the colder and darker winter mornings. 

Two teachers were used in each group, as was the custom at the 


center. These were matched for length and type of teaching ex- 
vere comparable in facilities, and Ss 


ree The rooms Ss used W à 

is ed a common playground and playground equipment. Teach- 

di programs and methods of handling the children were syn- 

h ronized in conference with the four teachers, so that the children 
ad comparable teaching experiences: 

ecause of unavoidable loss of Ss during the course of the study, 


ero plete information regarding the matching procedure of the’ two 
Hide can be found in the original Master's thesis. The matching was under- 
Dew Merely to safeguard comparability of the two groups at the onset of 
la "dy. The data indicate the matching to have been effective (see Ta- 
Since a significant difference between the groups, 1n March, was found 


fo à 
T only one item in the behavioral observations. 
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the final number of matched pairs was twenty instead of the 
original twenty-eight. Some Ss remained in spite of the matched 
partner dropping out, however, and these Ss were included in some 
of the data. Thus, although there were only twenty pairs, the 
total number of Ss in the two groups was forty-eight. The losses 
were spread fairly evenly throughout the age and sex categories 
so that the final age and sex distribution of the twenty matehe 
pairs was almost comparable to the original group of twenty-eight- 
Of the final twenty pairs, nine were boys’ pairs and eleven were 
girls’, the mean age for both sexes, in March, being four ye 
one month. This is in contrast to the original group of fou 
boys’ and fourteen girls’ pairs whose mean age was three years 
eleven months. 


ars an 
irteen 
an 


Types of Measurements Used 


. " : ; -orvation 
Behavioral observations. To assess Ss’ behavior, an observ? 


sheet with seventeen categories was devised. Figure 1 n 
some of the major categories and a few illustrative items." Eac e 5 
was observed by the time-sampling technique for three p " 
twice a week, during free-play periods. The S-A group was e 
served on Tuesdays and Fridays, the D-A group on Mondays p 
Thursdays. Ss were observed in random order, the matched Pp 
ners being observed in the same relative order within their resp f 
tive group. Different random lists were used for the six months 
the study, so that only by chance would a given S be observe 
the same time on several observation days. The three minute S 
servation period was divided into six sections of thirty "n 
each. The behavior shown by S at the time of observation was 


te- 
recorded by tallies in the appropriate categories.‘ In some € 


z ay 
? The complete Behavior Observation Categories Instrument T" thé 
obtained on written request to the author. For a detailed accoun ri inal 
categories and of their operational use the reader is referred to the ? 
Master’s thesis. 


4 n 
. Although there was no reliability check on the observations, an, 
in observation th. 


y bi 


0, since the data were not analyzed until t giat 
at during the course of the observations the igen bility 
ch way the results were going. The reason no re igor" 
training requi was that no second observer was available for the yet lack 
ofa relisbilit ‘red for the use of the observation sheet. Admittedly, M this 
study. Y check on the observations is a methodological weakness 
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FIGURE 1 


(1) Type of activity 
(a) Aimless or idle preoccupation 
(b) Looking on at some activity 
(c) Solitary activity 
(d) Parallel activity 
(e) Associative play activity 
(f) Coüperative play activity 
(g) Contact with adult 
(2) Amount of serious absorption (constructiveness) during play 
(a) Passive sitting or onlooking 
(c) Mostly conversing rather than playing 
(f) Serious absorption and oblivion to surroundings 
(3) Verbal communication during play 
(a) Primitive verbal usage 
(d) Talking to self or making noises 
(k) Arguing 
(4) Degree of follower-leadership shown toward peers 
Soaking or asking for help 
e) Leading an nizing others 
(7) Aggression Naar eine 1. ne immed. contact; 2. act of offense; 3. act 
of defense. 
(f) Disrupting group or solitary play 
(9) Crying j 
(b) Wh i vise attacked by another : 
(1) Tau E (With Imaginative Content and Rels-Playing) 
(4) (c) Very elaborate home situations 
Materials used ater realism) 


(c) Complex (improvising to achieve gre 
d illustrative items from the 


Franz 1. Some of the major categories an 
ehavior Observation categories instrument. 
i i ras shown pre- 
omi i i s, as for in 
,ommantly in the thirty seconds, 


i ile i ies a check was made for 
Type of activity," while in other categories righe rin 


the behavior r its duration in 
egardless of its dura d 
example in ndm ory 9, "Crying." Though — ie Saec 
Upon at a time, the names of 5s with whom he in 
noted on the side. 
he data used for analyzin 
9r each item, a S's mean score 


viding the total score for the month by à; 
Was observed in bat month. The possible range of scores for a 


BlVen observation day was 0 to 6, a score representing the oo 
of thirty-seeond intervals in which an item was checked. : 2o a 
Score of 5 meant the behavior was shown in five of the six thirty- 


‘i follows. 

the results were derived as | 
for the month was obtained by 
h by the number of days the S 
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second intervals. If S was present for only two observation days in 
the month, and on one day his score was 0 and on the other day 5, 
his mean score for that item for that month would be 2.5. Observa- 
tions were made about eight times a month, each group being 0b- 
served twice a week. Since some Ss were occasionally absent, the 
number of observation days for a given S for one month rang 
from two to eight days. 


* two sociometric tests were administered, one in late April and one 
at the end of July. The questions for each test were asked by ? 
volunteer worker, who asked each S whom he likes to play with 
three specified situations: in the sandpit; at clay; and with b 
(for the boys), or in the dolls’ corner (for the girls). with t 
choices in each situation, a maximum of nine different children 
could be named by $, though a lesser number was permitted. " 

Physical development. Measures of Ss' physical developme? 
were based on records kept by the staff nurse. She saw all childre? 
at the center routinely each morning for a brief checkup- In 00 3 
tion, they all had routine dental examinations throughout E 
period of the study, at which times the nurse was also presen m 
were also given a thorough medical examination by the staff doe E 
at both the beginning and the end of the study. On the beste 
these medical experiences, Ss were rated by the nurse on the follo 

ing variables: uv tO 
KSO) The child’s general health. This referred to his ability is 
withstand disease and excess fatigue, his ability to carry "X Ja- 

normal routine of activities, and his weight and body tone 1? i 

tion to his own body build. istles* 
(2) The child’s state of fatigue. This referred to chronic list ie 

ness and tiredness as manifested in the morning checkups 

nurse. 

(3) The child’s fear reactions to the medical sta 
examinations given by the doctor, dentist and nurse. , bei? 

The ratings were made on 3-point scales, each variable ng 
rue etia condition at the end of the study, and on the che 

ot condition from the beginning of the study. 

Absence records, A. sant of ‘absences aa kept by La sooty 
of the center. She noted the number of times each S was @ n det’ 
found out the reason for the absence, upon S's return ^ 


b 
TI 
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garten. Reasons for S's absence were analyzed at the end of the 
study, and these were categorized into the following: (a) illness or 
visit to a hospital or doctor; (b) outings, holidays or other celebra- 
tions; (c) family mishaps, which includes no one being available 
to bring S8 to the center a given morning; (d) miscellany, which 
includes mainly cases where no reasons were given or where vague 
statements were made from which no clear reason could be ascer- 
tained. 

Parents? questionnaires. To assess the parents’ evaluation of how 
Ss were progressing at home, two questionnaires were adminis- 
tered, one in April and one in August. In all cases the mother filled 
out the form. The questions on each occasion dealt with the parent’s 
opinions about the two types of attendance program, and about 
any changes noted in her child from the start of the study to the 
time of the questionnaire. On the first questionnaire there was an 
additional item, which was a request for the mother to keep à diary 
account for one week of the child’s activities at the times he was 
not at the center. This was included in the questionnaire for the 
Purpose of assessing whether the home activities differed for the 


two groups in spite of the precautionary matching. F 


Ne: three of the major variables, 
physical development and absences, 
checking for each of the twenty pairs which ae of 
larger score on the particular item 5 question. 

comparison were then tested for significance by use 
expansion, Since the concern in this study is only t? 


d 7 r score, the test for sig- 
tonne isa tee led pd oe es ex a significant difference 
, 


nificance is a two-tailed one. Thu : 
at the five eee level, fifteen out of à possible lisi p 
of one group would have to have larger scores plene icut 
Partners, this giving a significance level of 4.2 per € fen Wale 
figure to five per cent. The probabilities were ey mw 
and Lev (5, p. 458, Table IV B). Ties WP mud z 
them, e.g., if the ratio is fourteen to two members of one group 
having larger scores than thet Pre ee ren mh 
ve a rati i ur, signific . . 
In d aak ines is all Ss in the groups were used, not 
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just the twenty pairs of matched partners. This was necessary 
since the group structure involved all the children within the group» 
Ss obviously not being aware of who was and who was not in the 
final set of matched partners. (They were not even aware that they 
were so matched. All they knew was that every member of their 
group was observed on a given day by the investigator. Within the 
first few weeks they became quite oblivious even to this.) The 
method used to arrive at the group structure was a matrix analys E 
suggested by Forsyth and Katz (1). The manipulation of the matrix 
was achieved by use of the Matrix Analyzer, devised by s. B. Ham- 
mond of the Department of Psychology at Melbourne Univers 

As with the sociometrie data, analysis of the parents’ question” 
naires was also not based on comparisons of the twenty matehe 
pairs. This was necessitated by the fact that not all the parents ° 
the forty children returned the form. The analysis was thus base 
on all the questionnaires returned to the investigator, which 12° 
cludes some 8s who were not in the matched pairs and exclude? 
some Ss who were in the final matching. The data thus cover pom 
eight-five per cent of all Ss in the groups. (Inspection of the oe 
indicates that even if the fifteen per cent for whom no forms w 3 
returned were extreme deviants, the results would have been 9 
stantially the same.) 


RESULTS 


Behavioral Observations - 
; ; h 
It will be remembered that Ss were observed for SIX mont E 


from March to August. Comparison of the two groups on this V e 
iable was made for both July and August, since inspection © ere 
data showed that trends of change in the six-month period Pont 
not uniform. By taking two months as the terminal point mt 
which to measure the change that occurred since the first di 
m ed a more reliable criterion is established for eyalu® two 
vn ant differences found in the development 9 

ent Ais d over one hundred items, which mado e 
between the apad only five items showed a significant i differ 
ence in onl © groups. Four of these showed a significan owed 

Y one of the terminal months, and one other item 9» 


a significant change for both J uly and August, this item being “oblig 
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ingly following a peer request.” The nature of this significant de- 
crease was that the S-A group had shown this item significantly 
me in March (at the 0.7 per cent level), and by the end of the 
study the S-A group had decreased in this item to the low incidence 
consistently displayed from March to August by the D-A group. 
All significant differences found between the two groups on the 


Kr x observations are found in Table I. 
rom Table I it can be seen that by the end of the study the 


S-A group had more parallel play and showed more coöperative 
acts in relation to aggressive acts than did the D-A group. But the 
S-A group also had more distractible behavior, and it did not keep 
its higher standing in the item of obligingly following peer requests. 
In contrast, the D-A group had made a greater improvement in 
relation to the S-A group by decreasing in aimless or idle preoccu- 
pation. However, although the differences were significant, except 


ES BETWEEN THE Two 


TABLE I—SroNrricANT DIFFERENC 
RVATIONS 


GROUPS IN THE BEHAVIORAL OBSE 


Items March July August 
a ee 
Parallel play (1d) S-A > D-Af 
Tli (P < 0.03) 
llitting about and/or bei BA. > DAE 
silly (2b) fat Penne (P < 0.03) 
wv 
= Cooperative acts to S-A > D-Af 
peers/Z aggressive acts to (P < 0.01) 
peers (Z category 6/Z cate- 
o Bory 7) 
bligingly following a peer S-A > M 
quest, (Sdl P < 0.01 , 
SUM in  — fol- s S-A decr'd 7 an > 
Owing a t > DA ^ 
(from March) — me (œ < 0.01) | (P < 0.05) 
ns in aimless or idle D-A oe 
reoccupati > 5- 
pation (la) (from p= 0.002) 


of the above three months 


| * Although i e item on any 
did not differ La c = b two groups» the groups nevertheless re- 
eed. their standing on the item. This reversal gave a significantly greater 
ecrease from March to July for the D-A group- 
í An advantage for the S-A group- 
§ A disadvantage for the S-A grouP- 
An advantage for the D-A group: 
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for one item they did not appear in both terminal months, thus 
indicating a trend only and not a definitive change. Moreover, M 
over a hundred comparisons made for each terminal month, five 
items can be expected to show a significant difference by chance 
alone. In view of these considerations, the results must be taken to 
indicate that the groups did not basically differ by the end of the 
study, since the points of comparability significantly outweighe 
the points of difference. 


Sociometric Data 


In group structure some differences (obtained by inspection A 
the sociomatrices) appeared between the groups, and these differ 
ences were most apparent in the second test. On the first sociome Á 
ric test (end of April) the S-A group was composed of several clique? 
which were fairly well integrated with one another. The D-A group 
in contrast had one quite large and well-integrated subgroup a 
posed of several cliques, with a few small cliques fairly independe” 
from this subgroup. On the whole, neither group was very T": 
integrated, and both groups tended to show unisexual preference 
The groups were comparable in their percentage of isolates 27 
unreciprocated choices. In degree of integration of the total gn 
the 8-A group was slightly superior. At 

A marked change occurred by the second test (end of July) it 
this time both groups became considerably more integrated» < A 
their method of achieving integration differed markedly. Tho 
group formed two quite independent subgroups, which were nm 
posed entirely of unisexual friendships. In contrast, the D-A LEE ] 
expanded its earlier subgroup to include two-thirds of the mem uch 
The cohesion between these members was very strong; with m 
exchange of mutual choices. This large subgroup was bisexuñ „knit 
rest of the group structure was made up of à small, Joosel¥ e 
cluster of the older boys, and of a few isolates. Although the d 2? 
group had by far the greater integration, neither group achiev third 
ideal group structure, in that the D-A group had nearly Gas 
of its members peripheral to the tightly-knit core, and in had z 
S-A group had too little overlap between its cliques and 1 
marked cleavage between the two sexes. oint$ in 

In spite of their differences, the two groups had — witbiP 
common. One was a marked social mobility of the members 
each group, from the first to the second test. A tau (3) 0 
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obtained for the D-A group and a (au of 0.34 was obtained for the 
S-A group, these showing a marked change in the social rank of 
the members within each group. A further point of similarity was 
that with the increase of integration in the two groups, the choices 
made by Ss became more “realistic.” To assess the validity of the 
choices (i.e., the degree of “reality” of the choices), the choices 
were compared with the names of the children with whom each 8 
interacted during his observation period, at the time the sociometric 
Pas were made. The mean ratio of “unrealistic” choices dropped 
rom the first to the second test from 0.51 to 0.29 for the D-A group, 
and from 0.42 to 0.31 for the S-A group. 

In summary, differences between the groups appeared in the 
type and degree of integration which each group evolved. The dif- 
ference increased with time, and at the end of the study the D-A 
group was found to be the better integrated group. In spite of these 
differences, the groups were alike in showing marked social mobility 
and more realistic choices from the first to the second test. 


Physical Development 


In physical development the two groups were nearly identical. 
Tn both groups the majority of Ss (seventy-five per cent) had im- 
proved in health from March to August, though the majority (sixty 
Der cent) did not change in their state of fatigue and in the amount 
of fear shown to the medical staff. In general, Ss were rated to be 
in good health with little or only moderate fatigue at the end of the 
study, in August, the majority of Ss also showing no fear reaction 
to the medical staff. No differen 


tial age effect was found, the over- 
all findings adequately representing both the younger and the 
older Ss. 


Absences 
Absences were compared for all six months of the study. In total 


number of absences the D-A group had nearly twice as many days 
r of days lost being fourteen 


lost as the S-A group, the mean numbe d 
Der S for the D-A group, 7-5 per S for the S-A group. Considering 
that the D-A program called for twice as many days of attendance, 
the obtained difference would be expected on an a priori basis. In 
the breakdown of “reasons for absence” the groups did not differ 
in any consistent way. For both groups the vast majority of ab- 
Sences were due to illness or visit to à hospital or doctor. The only 
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other sizable reason for absence was outings or family vacations, 
and this occurred in less than twenty per cent of the total number 
of days lost. 


Parents’ Questionnaires 


It will be recalled that eighty-five per cent of the parents an- 
swered the questionnaires; of these, eighty-eight per cent answered 
both forms. Analysis of the two questionnaires revealed that ninety- 
five per cent of the D-A and eighty-five per cent of the S-A parents 
preferred daily attendance. This strong a preference was expressed 
on both the first and the second questionnaires. Although the ma- 
jority of the parents who answered both forms did not give the 
same reasons on both forms for preferring daily attendance, the 
types of reasons mentioned most often remained quite similar. 
(The questionnaires were of the open-ended type, so that the re- 
sponses had to be grouped into categories after the forms were 
returned.) The reasons given the most number of times were: (a) 
the child enjoys kindergarten, in general; (b) the child enjoys the 
peer companionship at kindergarten; (c) home does not provide as 
satisfactory interests or activities; (d) kindergarten is a necessary 
preparation for school. These reasons all have the implicit assump- 
tion that, since kindergarten is a desirable place, there should be as 
much of it as possible. 

On the questionnaires the mean number of positive changes per 
child reported on both occasions for both groups was 1.6. The two 
groups were quite comparable in this, on both forms. Some differ- 
ence between the groups was found, however, in the percentage of 
children for whom positive changes were reported: for the two ques- 
tionnaires, the mean per cent of Ss for whom a positive change was 
reported was ninety-six per cent for the D-A group and seventy-six 
per cent for the S-A group. Negative changes were negligible, being 
cited in both groups for both forms in less than ten per cent of Ss. 
On both forms the main positive changes reported were: (2) 
widened intellectual development and maturity; (b) greater con- 
fidence and social independence; (c) greater sociability with peers. 
The one marked difference found between the groups occurred on 
the second questionnaire, at which time a considerably greater 
number of D-A than S.A parents reported that their children had 
improved in the sphere of social interaction. The improvements 
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cited by the S-A parents tended to group themselves under the 
area of greater intellectual maturity, which included the child’s 
greater ability to amuse himself at home. 

Data from the weekly diaries revealed that both groups had 
nearly identical activity experiences at the times they were at home. 
The vast majority of social contacts which the children had were 
with their immediate family, of which siblings were the most fre- 
quent contacts. The types of activities undertaken were playing 
(as an undifferentiated activity, i.e., the mother giving no account 
as to the nature of the play), going shopping with mother or similar 
outings, reading or being read to, and daytime sleep. Creative activ- 
ities were minimal in both groups, and even “reading or being read 
to,” which was relatively high in the activity list, was cited as oc- 
curring, on the average, only three times during a child's week. 


DISCUSSION 


'The results indicate that the only area in which a consistent 
difference was discernible between the two groups was in social 
interaction. Both the sociometric data and the parents’ reports 
indieated that the D-A group had increased relatively more than 
the S-A group in positive peer interaction. That this difference. m 
not complete, however, was seen in the fact that the porem 
observations did not substantiate this finding. In fact, the observ ot 
tions showed that the groups were ee yy ^ 
associative and coóperative play (terms taken c d "pin e 
which denote a higher level of social play than is ex n Aes : 

d that the S-A group had more parallel play and showe 
play), = erative acts in relation to aggressive acts than did the 
DA, 2 at the end of the study. Both of these results would indi- 

-A gro p - social development of the S-A group, but they cannot 
cate superior liably established since neither of these observational 
be taken as ii ri ud both terminal months. Since the behavior ob- 
results occut t substantiate the trend suggested by the'sociomet- 
servations do ee arents’ reports, and since the observations tend to 
ric data d 2 - d, only a limited conclusion can be made about 
dispute bei ini the groups. At best it can be concluded that 
differences D. A group was not observed to show more social inter- 
though the e S-A group, when Ss in the D-A group did interact, 
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have been this wider social scope that carried over to the home 
setting and prompted more parents of the D-A group than the 
S-A group to note positive social changes in their children. 
Except for this difference between them, the two groups devel- 
oped in fairly similar lines. In part this confirms the original sug- 
gestions made by Gesell, and in part it also contradicts them. Un- 
like the parents of the Yale studies, the parents of the present study 
overwhelmingly preferred daily attendance. The weekly diary 
also showed that during an average week the children on spaced 
attendance do not go on more outings than the children on daily 
attendance. Both these findings, which contradict the findings of 
the Yale group, undoubtedly are a result of the lower socioeco- 
nomic level of the parents in the present investigation. Another 
finding that contradicts the suggestion made by Gesell is that the 
children, including the three-year-olds, do not suffer from greater 
fatigue or ill health by daily attendance. In addition, the lack of a 
clear-cut superiority of one group over the other points out that 
assimilation of newly learned material by use of alternate days at 
home is not necessary. On the other hand, the present study does 
confirm the idea that continued attendance is also not essential for 
healthy progress at kindergarten. 

The findings from the present study cannot be completely com- 
pared to those of the Yale studies, however, since the present groups 
were not made up of children of the same chronological age, as were 
the groups used in the Yale studies. Groups formed on the basis of 
a similarity in age may yield somewhat different results from those 
obtained with a mixed-age group as used in the present study. It is 
therefore possible that the original Yale findings will be borne out, 
i.e., that three-year-olds, in a group made up of their own age only, 
would profit best from kindergarten on a spaced attendance basis; 
while five-year-olds, in a group of their own age only, would profit 
best from daily attendance. One further qualification to the results 
may also reside in the fact that the majority of the present 8s had 
attended kindergarten previously. Although there is nothing 1n the 
present study to suggest it, it is nevertheless conceivable that wit 
a group of children who have never been to kindergarten different 
results might have been obtained. 

In view of the present results, however, it does appear that the 
type of attendance program, in terms of alternate or daily morning 
attendance, does not appreciably affect the children’s general de- 
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velopment. Thus, the type of attendance program put into opera- 
tion rests on other factors, such as the needs of the community and 
of the individual child, as well as the parents’ preference for one or 
the other type of program. 
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Davi» ABRAHAMSEN. Who Are The Guilty? A Study of Education 
and Crime. New York, 1952, pp. 340 + XIV. 

Crime increase in the United States is alarming. Between 1940 
and 1950 every crime except robbery increased out of porportion 
to increase in population. The crime rate in the United States is 
higher than in Italy, which has the highest in Europe, and more 
than ten times as high as Scandinavia. What are the reasons for 
so much crime in the wealthiest country on earth? Why has it 
taken so many diversified forms, ranging from the singular crime 
such as robbery, rape, kleptomania or murder to organized form 
such as gambling, fire setting, insurance fraud, betting, kidnapping, 
climaxing in gangsterism and racketeering? The social, economic 
and emotional significance of crime warrants the attention of 
everybody, not only professional criminologists, says Dr. Abraham- 
sen in the question that he raises, namely, Who Are The Guilty? 
Is it society, home, school, the delinquent or the criminal himself? 
Everybody is involved. Not only the criminal himself. Not only 
the delinquent himself. Involved also are the seemingly innocent— 
politicians, college boys, policemen, even high officials. The statis- 
tics of crime are appalling enough to compel us to ask the question 
about the nature of our society. Is America really a country full 
of power without purpose, as it has been accused of being? And, 
is it our society that is guilty? Generally speaking, that’s not Dr. 
Abrahamsen’s answer—that’s only a partial answer. But, so also 
is blaming the delinquent himself as if he were self-made—a par- 
tial answer. Who are the guilty—how are delinquents made—how 
can they be unmade—that is, how can they be understood and 
how best treated by an enlightened society—are the questions 
Dr. Abrahamsen considers in this interestingly written volume 
called Who Are The Guilty? 

In twelve chapters, Dr. Abrahamsen considers 
the problem and treatment for it. Eight chapters ar 
consideration of the nature and extent of the problem. These 
clude, besides general statistics, a consideration of early signs 9 
juvenile delinquency; psychology in the criminal; psychosomatic 
disorders in antisocial behavior; crime and mental illness; delin- 
quent types; sex and murder. The last four chapters are devote 
to what can be considered treatment and re-education, and these 
are considered in chapters called Prisons and Imprisonment; Treat- 
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ment and Rehabilitation; Mental Health and Schools; Education 
is Prevention. His attitude is expressed by him in the quote from 
a previous publication of his, which reads as follows: “. . . in treat- 
ing the offender, one must keep in mind that the principle upon 
which the offender can be changed and educated is based upon his 
social experience and his reaction to it. And it is this social ex- 
perience that is the basic stimulus for the individual’s re-educa- 
tion.” The appendix includes a glossary of prison jargon, a bibli- 
ography, as well as an index. 

In the preface, Dr. Abrahamsen expresses the hope that this 
book of his becomes a primer about education in crime. It is his 
purpose “to make the reader see and hear so he can feel what is 
happening in this vast but little understood area of human be- 
havior." 'To a fair extent he succeeds in doing so in this volume. 
He is more successful in selecting and presenting the kind of con- 
tent that can yield knowledge for understanding and use that will 
enlighten the lay people interested in the problem. It will not, 
however, satisfy, for example, the people who are interested in 
What is the known and verified knowledge, what are the results of 
present research, and the strength and weaknesses of methodology 
used in research in this area. Too much of the material is presented 
in a highly personalized manner. Too frequent use is made of 
quotes expressed by him in his own previous publications. Gen- 
erally speaking, however, the point of view is in line with modern 
attitudes of informed workers; familiarity with the literature is 
manifested by the author in spite of his more frequent reference to 
his own writings, and only casual reference made tojatudies made 
by other investigators. In his reaction to W. H. Sheldon's book on 
Varieties of Delinquent Youth, he is, however, more critical. What 
he says of it is that Sheldon stresses the constitutional factors 
and gives scant attention to the offender's inter-personal relation- 
ship. The facts and attitudes that he does call to the attention of 
the reader should serve to arouse an interest in a topic that is of 


importance and that does need organized and continued atten- 


tion. H. MELTZER 


Psychological Service Center, St. Louis 


Onrs Ross Yost. What You Should Know About Mental Illness. 


New York: Exposition Press. 1953. pp. 165. 
Not everything is known about mental health or about the 
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treatment of mental disease, described here as “the malady of our 
times.” But much of what is known is not applied by the people 
who could profit from it. The purpose of this small volume by 
Dr. Yost, the medical director of a sanitarium in South Carolina, 
is to present for the lay reader facts about mental illness, ways of 
preventing it, treating it, curing it. Behind the presentation is an 
attempt to give a more hopeful picture of the possibilities for 
treatment of mental illness. It is intended primarily for the in- 
dividual and for his family. The material is presented in three 
parts. Part One, called The Mind in the Making, includes short 
chapters on The Róle of The Mother, The Rôle of the Teacher, 
and The Róle of the Community. Part Two, called Mental Illness 
and Its Treatment, describes The Sick Mind, Psychosomatic Ill- 
nesses, Techniques of Treatment, and chapters on Alcoholism, 
Drug Addiction, Epilepsy and Suicide. Part Three, called Mental 
Illness: A Public Responsibility, includes short chapters on The 
Care of the Mentally Ill, Mental Health: The Goal, Steps Toward 
Mental Health: The Community's Responsibility, and Psychiatry 
and Religion (with an introductory note by Norman Vincent 
Peale). The book contains a Foreword by Dr. Nolan D. C. Lewis, 
and an Introduction by Dr. Leland E. Hinsie, as well as the note 
by Rev. Peale. 

The book is not intended to be a critical presentation of the 
known facts. It is, instead, presented in a manner that aims to 
give the reader “a feeling for psychiatry,” to use Hinsie’s words 
in his introduction. It is this feeling, Hinsie thinks, the author 
puts into his book that makes it attractive to the general publie. 
It is also this feature that may receive more critical and ques- 
tioning attitudes on the part of the more scientific-minded. Dr. 
Peale's introductory note to the last chapter on Psychiatry an 
Religion is perhaps better preaching than it is modern menta 
hygiene, when he says that “the false gods of comfort and security 
to which man has given his energy and loyalty must give way to 
a true God, the Creator and Almighty Father, under whose leader- 
ship and protection a true brotherhood may be created.” This is 8 
frame of reference which should make it possible for some psychia- 
trists to get along well with religio-therapists, and would probably 
be welcome to people who have members of their family in Dr- 
Yost’s institution or similar institutions. If hope and faith based 
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on popularized interpretation of knowledge is the goal of the book, 
then, to a fair measure, it serves its purpose. H. MELTZER 
Psychological Service Center, St. Louis, Mo. 


ADOLF NICHTENHAUSER, Marre L. Coteman AND Davi S. RUHE. 
Films in Psychiatry, Psychology and Mental Health. New York: 
Health Education Council. 1953, pp. 269. 

This is an excellent reference volume for the increasingly large 
number of people who are in position to use films at professional 
meetings or meetings for popularizing their area of knowledge. 
'The body of the book consists of 51 critical film reviews designed 
to help the film-user. The intent is to make it unnecessary for 
people who want to select films to preview them or go to the ex- 
pense of spending time and money for previewing the films, before 
deciding concerning the suitability for intended use. Most of the 
reviews in the volume were prepared by Marie L. Coleman. Where 
this has not been the case, it is indicated and credit given. Of the 
51 films reviewed, only 2 are intended for psychiatric training. 
These are Mental Symptoms series and Psychotherapeutic Inter- 
view series. The variety of topics included is wide, but most of 
them are in the field of therapy, developmental psychology or 
etiology of neurosis. The content is presented in three parts: Part 
I, the smallest, includes a gallery of scenes from the films con- 
sidered, a foreword, a preface and a listing of acknowledgements. 
The real content begins with Part IT, which contains à discussion 
of the film reviewing techniques and suggestions for the utilization 
of films in psychiatry, psychology and mental health. Included 
here are chapters on how to use the book, how the content and 
presentation influence each other in psychiatrie, psychological and 
mental health films, the use of — Le qt for the bl og 
of i and a-half-century of motion pictures in neurology, 
oe, ay and mental health education. Part III is 
composed. of reviews of representative films in these three areas, 
and some of the topics included are: Activity for Schizophrenia, 
Angry Boy, Children’s Emotions, Eight Infants, Family Circles, 
Feelings of Depression, Feeling of Hostility, Feeling of Rejection, 
Frustration Play Techniques, Overdependency, Problem Children. 
Part IV of the book is composed of indices to films and contains 
also an index to the content of the film subject matter of 101 films 
in these three fields and a suggested audience guide to the 51 films. 
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The appraisal in the three sections is composed of three portions: 
(1) the content section, which analyzes the purpose and objectives 
of the film and examines the treatment of the subject; (2) the 
presentation section, which discusses the film style used, a record 
of actual cases, reénactment dramatization and the skill with 
which the various factors of motion picture presentation have been 
applied; and, (3) the effectiveness section, which reaches conclu- 
sions from the analysis of content and presentation and the rest 
of the effects of the film. In general, it attempts to answer the 
questions: what can be learned from the film?, who can learn from 
it? and, how well can one learn from it? 

Allin all, there is a good deal of knowledge for use here which 
can help improve decisions concerning the use of films for purposes 


needed in teaching or popularized presentation of the many topics 


included. H. MELTZER 


Psychological Service Center, St. Louis, Mo. 


Health and Human Relations. (A report on the third of a series of 
international conferences on Health and Human Relations, 
held in Hiddesen near Detmold in Germany, August, 1951, 
sponsored jointly by the Josiah Macy, Jr., Foundation and 
the World Federation for Mental Health.) New York: The 
Blakiston Company, Inc. 1953. pp- 192. 

Health and Human Relations is a report o 
in a series of international conferences, sponsored by the Josiah 

Macy, Jr., Foundation and the World Federation for Mental 

Health. The three co-chairmen were: Dr. Frank Fremont-Smith, 

Medical Director of the Macy Foundation; Dr. John R. Rees, 

Director, World Federation for Mental Health (who chairmanned 

the previous two conferences); and Dr. Werner Villinger, Director; 

Neuropsychiatric Clinic, University of Marburg. The editors of the 

English report have tried to present results of the conference in & 

manner that would be a projection of the experience and reflect 

the spirit of the meeting, and show both the organization and om 

dynamics of its development. Generally speaking, the editors di 

an excellent job in giving this kind of report. The material 18 

presented in three parts. Part I deals mostly with the reports © 

the working groups, composed of groups on Mental Hygiene 2n 

Education, Mental Health in Modern Society, Sources of Soci 


Tensions in Postwar Germany, and Promotion of & Mental Hygiene 


f a conference, third 
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Movement in Germany, with a list of recommendations for con- 
ference action. Part II is composed of informal presentations by 
select speakers, which include a discussion on Health and Human 
Relations, the Problem of Neurosis, The Sense of Inner Identity. 
Part III includes the comments by four of the participants. The 
volume also includes an appendix—the Conference Who’s Who. 

In the reports, one is informed that the purpose of the Con- 
ference is to do more than merely discuss, that many of the partici- 
pants were actually doing things that have real significance for 
world health. Demonstrations of such influences are cited. Making 
decisions about the invitation list for the Conference seemed to be 
a problem for the people who made the preparations for it, but 
the list seems to be fairly representative of psychiatric and related 
auspices and people in the fields in Germany at the present time— 
a total of thirty-six participants. 

The goal of this, as well as the previous two conferences, is 
certainly an admirable one. It is devoted to “the aim of improving 
health in the broadest sense of the word as a basis for truly human 
and dignified interpersonal relationships.” Illustrations of the 
material considered by some of the working groups can be given 
by some of the questions considered by, for example, the first 
group on Mental Hygiene and Education. The questions considered 
included the meaning of health, the meaning of authority in 
human relationship, the meaning of freedom, the goals of education, 
the réle of fear in educative process, ways of integrating religion 
and education, qualities of leadership needed in the world today. 
Some of the questions considered by the second group working 
on Mental Health in Modern Society were: how we can help in- 
dividuals to become more independent in their thinking and thus 
be less influenced by mass movements; what ways of the mass 
media of communication, radio, film and press, affect human re- 
lationships; how we can plan for further research on the problems 
of mass behavior; and how we can encourage codperative staff 
planning and thinking within all kinds of organizations; and, what 
are the qualities of leadership most needed in the world today and 
how can such leadership be developed. People interested in the 
World Federation of Mental Health doings will find this an in- 
teresting volume, including the large appendix describing in some 
detail the Conference Who's Who. — H. MELTZER 

Psychological Service Center, St. Louis, Mo. 
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Kennetu W. Hoposox. The Deaf and Their Problems. New York: 
Philosophical Library, 1954, $6.00, pp. 364 plus xvii. 


In “The Deaf and Their Problems” the English author, Kenneth 
W. Hodgson, gives an excellent historical account that will be of 
interest to all people in the field, no matter where they live. Though 
the reference point for consideration of their problems is England, 
the knowledge considered and the history of the interest in fulfill- 
ing the social responsibility of educating the deaf is of universal 
nature, In this book one can find an excellent historical account 
of the kind and the amount of attention that was given to the edu- 
cation of the deaf from the sixteenth century onward. It also in- 
cludes a number of chapters on the mechanism of hearing and the 
assessing of damage to the ear. Brought to a head also is a con- 
sideration of what has been a dilemma in methods of educating 
the deaf—the question of oralism (teaching by speech and lip- 
reading) versus silent methods (by signs and finger-spelling). Here 
also one can get a description of all of the important contributors to 
the field over the ages. Interesting by-produets, also included, are 
the descriptions of Laura Bridgman, Helen Keller, the contribu- 
tion of Alexander Graham Bell, as well as the contribution of Gal- 
laudet. 

In the preface, written by R. A. S. Paget, which includes an ex^ 
cellent review of the intentions of the volume, the question of defi- 
nition of goals is raised—is it to educate the deaf so that they shall 
appear to be normal, or is it to educate their minds so that they 
may, as soon as possible, come out into the hearing world as its 
intellectual equals? That is a question for Mr. Paget. The author, 
himself, seems to have no dilemma of this nature and conceives O 
education for the deaf as having an unreserved goal, namely the 
achievement of the reality of full citizenship. Accordingly, the 
author finds the present stage of development merely the begin- 
ning of an international consideration of the problem. 

He ends the book with the calling of our attention to the need 
for educating colored as well as white and this he does in the fol- 
lowing words: "There are now nearly 1000 deaf schools in the 
world. But four out of every five of them are for white children. 
Tens of thousands of colored children are still in that uncouth 
misery to which all deaf children were once condemned. For them 
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our four centuries of progress is something which has yet to 
happen.” 
H. MELTZER 
Human Relations Research Foundation, St. Louis, Missouri 


Percivar S. Symonps. Dynamics of Psychotherapy: The Psychol- 
ogy of Personality Change. Volume I. New York: Grune & 
Stratton, 1956, pp. 201. 


This is the first of a three volume edition. The subtitle for this 
volume is Principles, and according to the preface, “outlines the 
general psychodynamic principles on which contemporary psycho- 
therapy is based." Discussion is limited to principles and does not 
attempt to deal with the problems of who should practice therapy 
and of professional relationships. The point of view is "essentially 
eclectic;” the purpose is “to integrate these diverse points of view 
into a consistent and unified whole." 

In sixteen chapters the book deals with the definition, need and 
aims of psychotherapy; the process of psychotherapy; indications 
for counseling; indications for psychotherapy; contraindications 
for psychotherapy, the nature and definition of psychoneuroses; 
the dynamies of psychoneuroses, (two chapters) ; neurotic trends 
and symptoms; predisposing and precipitating factors; goals of 
psychotherapy (three chapters) ; spontaneous recovery (two 
chapters), and some theoretical problems including determinism 
and free will, psychotherapy and learning, values, culture and 
education. . r id 

Psychotherapy seems to have many aims: improved ability to 
function in work and family; ability to drop one's defenses and to 
face problems; inner peace and freedom from anxiety, worry and 
stress; self-sufficiency and self-responsibility; a help to childhood 
development and greater independence; honesty with self; "inte- 
grating the various parts of the mind into a smoothly running 
Whole, so that the individual will have all of his forces under his 
active control;" relief from suffering; achievement of happiness; 
conscious recognition of unconscious goals; good adjustment; ma- 
turity ; release from and reduction of tension; personality integra- 
tion; improvement and strengthening of the superego; new super- 
ego standards; better insight and proper self-evaluation; a new 
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philosophy of life; greater adequacy and competence; increased 
capacity for social intercourse; and even the “attainment of free 
will.” 

The author states that “Therapy fulfills its task when it helps 
the individual to accept himself more fully, to gain insight with 
regard to himself, and to find freedom in self-expression. ... The 
direction that his expression will take must be determined by the 
forces of society.” “Spontaneous recovery” is to be considered as 
well as recovery due to psychotherapy. 

In this excellent study the reader may find some difficulties; 
there seem to be too many dogmatic assertions but the author ex- 
cuses them because of Jack of experimental data. On page twelve 
the thesis of the book is “that psychotherapy consists of introduc- 
ing into the life of the individual some new influence which will 
counteract the influence of previous experiences which have given 
rise to his neurotic difficulties.” On page nineteen the thesis is 
“that abreaction is the principle dynamic factor in therapy.” It 
is a question as to the exact relation of the terms “psychoneurosis” 
and “neurosis,” when they are treated separately and somewhat 
differently in the index. i 

The attempt to integrate the various points of view existing in 
connection with psychotherapy today may leave the reader with 
some doubt as to whether they can all be integrated, but he may 
have a high respect for the valiant effort. Psychoanalysis may 
appear to loom too large in this volume, but the author clearly 
states that it is not all of psychotherapy. 

This is a valuable contribution to the study of psychotherapy- 
A well made index (pp. 201-211) concludes the volume. 
A. S. EDWARDS 


The University of Georgia 


ALBERT J. Harris. How To Increase Reading Ability, Third T 
tion. New York: Longmans, Green and Co., Inc., 1050; pues 
pp. 633 plus xix. 


This text deserves wide use. It does little to reduce the sediment 
of speculation about the controversial subject of reading instruc- 
tion, but neither does it muddy the pool further. The author's pri- 
mary aim is to provide "scope, balance, and practicality” anc; 
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secondarily, “enough theory to provide a basis for judging when 
and why a particular technique may be useful.” In addition, he 
promises “a new theory on the significance of hand and eye domi- 
nance in relation to reading disability.” 

He has achieved the primary aim. Topics discussed include the 
bulk of relevant skills, techniques and controversies, couched in 
simple language. Teachers will be pleased to find many useful 
techniques not to be found elsewhere. The author has a lucid style 
and a knack for approaching problem areas historically, thus pro- 
viding for the student a perspective not easily achieved otherwise. 

The secondary aim, that of providing theory where needed, is 
achieved less perfectly, at least in those instances where theory 
is treated explicitly, i.e., where an attempt is made to apply cur- 
rent theoretical constructs. The reader may be surprised to find, 
for instance, that “As it relates to word recognition, insight means 
attaining a clear and accurate perception of the printed word” (p. 
319). On the other hand, when the author takes positions on con- 
troversial questions, his implicit theoretical assumptions appear 
to be both accurate and consistent( pp. 22, 28, 36, 80). One serious 
exception is that concerning the important problem of the compo- 
nents of reading. The weight of evidence from factor analyses of 
reading comprehension points to “words” and “words-in-sentences” 
as the only clear components. Dismissing the evidence because of 
apparent disagreement of “statistical experts,” somewhat more 
justified in 1946 than in 1956, the author devotes a chapter to eight 
“special comprehension skills.” It is questionable whether such 
special skills exist qua skills. They appear to be conceptual activi- 
ties most easily influenced by attention to “words” and “words-in- 
sentences.” To continue sending teachers down what appears to be 
a blind alley may be a serious error. 3 . . 

The author's new theory of hand and eye dominance in reading 
disability, briefly stated, consists of à common denominator for 
reversal symptoms, viz., directional confusion caused, perhaps, by 
neurological defect or slow maturation, conversion from left to 
right handedness, or emotional tension. The theory has consider- 
able merit and is congruent with recent evidence from the re- 
viewer’ tory. 

Dd ars tn should be given the appendices which in- 
clude an annotated list of over one hundred published tests, ad- 
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dresses of publishers and a graded book list of more than one thou- 
sand one hundred titles. This book should prove valuable both as 
a text and as a reference manual. 
Dowarp E. P. SMITH 
University of Michigan 


Win A. Hunt. The Clinical Phychologist. Springfield: Charles 
C Thomas, 1956, pp. 205. 


This book presents the Thomas William Salmon lectures for 
1954. The committee made an excellent selection for lecturer, since 
Dr. Hunt has not only lived through the period of the development 
of clinical psychology, but has been a member of committees which 
have had to deal with the difficult and intimate problems of rela- 
tionships with other professions. It was therefore possible for him 
to bring to light pertinent facts that have not been generally known. 

The purpose of the lectures was to bring about a better under- 
standing of the problems and relationships, especially between 
psychology and medicine. The three lectures are presented under 
the following titles: “What Is a Clinical Psychologist?” “How the 
Clinical Psychologist Came to Be,” and “The Problems and Fu- 
ture of Clinical Psychology.” 

The treatment is essentially historical. It begins with general 
psychology as the background, and proceeds to discuss the gradual 
development of clinical psychology upon the scientific foundations 
already achieved. It goes on to discuss the increasing demands an 
responsibilities laid upon psychologists, with special reference to 
the efforts to work out the proper relationships between medicine 
and psychiatry on the one hand, and psychology on the other. A 
special interest of the author is the desire to contribute to getting 
beyond the stage of definition and argument and back to the great 
and much needed task of codperative work in promoting menta 
health. . 

As we have found historically in other fields, and as appears m 
connection with the problems involved in our own profession, it has 
been found to be more difficult to work out satisfactory definitions 
and limitations of duties and responsibilities, than it is to cooper 
ate, in actual practice, with those who are willing to coüperate. 
Perhaps the satisfactory definitions will be possible only pri 
continued evolution and development in the work of those who 
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continue to practice in proper consideration with and considera- 
tion for one another. Progress in the right direction is indicated in 
the text. 

Among the many problems discussed are diagnosis, the use of 
tests, the use of professional judgment, research, kinds of psychol- 
ogy, the nature and methods of therapy and psychotherapy, certi- 
fication and licensing, psychology’s claim to the three functions of 
diagnosis, research and therapy, the clinical psychologist’s work 
as privately encouraged and publicly denounced, the question of 
who is “ill,” the future of clinical psychology, etc. 

This is probably the best brief, yet reasonably comprehensive 
statement of the work, problems and progress of clinical psychol- 
ogy at its present stage. It is written clearly, effectively, and with 
a delightful sense of humor. It is also written with good judgment 
in a situation in which the latter is certainly needed. The few 
minor adverse criticisms might be made to seem out of place in 
view of the general excellence of the entire volume. It is highly 
recommended to anyone who desires to learn accurately about 
clinical psychology. 

The book concludes with sixty references. 

A. S. EDWARDS 


The University of Georgia 


James L. Hymes, Jr. A Child Development Point of View. New 
York: Prentice-Hall, Inc., 1955, pp. 145 plus ix. 


The promotion of health and the prevention of mental illness 
is what the author of this volume, Mr. James L. Hymes, Jr., has in 
mind as a goal in writing the small volume on the child develop- 
ment point of view. The material is presented in popularized man- 
ner, both in the selection of captions and in the write-up. The 
captions, themselves, pretty well describe what the author has in 
mind, as well as his level of presentation in the light of it. “You 
are a Teacher” is followed by “Your Youngsters Must Like Them- 
selves” and “Your Youngsters Can Climb to Highest Peaks.” The 
purpose of the volume is relatively well indicated in his last para- 
graph of this small book: “Once you truly regard the child as a 
human—caring, seeking, preferring, needing—you have to work: 
You have to think, to use your judgment and your sensitivity. But 


the motor is running, and the course is charted. How far will the 


* 
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human go? How fast? For how long? With your good treatment, 
new records are possible: of reason and decency, of kindliness, of 
caring. The task is well worth the effort of your hands, the time 
of your head, and the searching of your heart.” 

This little volume is not intended as a text in educational psy- 
chology. It can serve excellently as a public relations manual for 
the inspiration of school teachers. 

H. MELTZER 

Human Relations Research Foundation, St. Louis, Missouri 


CHARLOTTE TowLe. The Learner in Education for the Professions. 
Chicago: The University of Chicago Press, $7.50, pp. 432. 


The Learner in Education for the Professions as Seen in Educa- 
tion for Social Work—the full title of this volume—precisely de- 
scribes the nature of the volume, in content as well as in interpre- 
tation. Charlotte Towle is an experienced social worker, who has 
played an active rôle and has been a part of the growth of psy- 
chiatric social work and talks from direct observation and experi- 
ence. What she says is of particular importance to social workers, 
but much of what she says is also of significance to professional 
workers interested in education in their own areas. 

The material is presented under two parts. Part I is called 
“Learning Process and Educational Process” and considers objec- 
tives of professional education, basic learning principles and also 
personality development and the integration of learning. This part 
also considers administrative dimensions such as selection of stu- 
dents and the social component in education for professions. ? 

In Part II, called “Applications in Teaching Social Case Work 
are included a consideration of the place of social case work, areas 
of learning in case work, selection and arrangement of case mate- 
rial, faculty collaboration and the content of class sessions, assign- 
ments and examinations. í 

The general attitude of the author is given in the early part 0 
the first chapter: “Professions today are intent upon their contri- 
bution to the development of the individual rather than with serv- 
ices that merely assure survival. It is fast becoming the common 
sense that, to serve mankind for its development, one must under- 
stand man... Responsible, intelligent, professional help must In- 


creasingly be oriented to the individual in the context of his s0012 
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position, with perception of what the service is doing to the person 
while doing something for him.” " 

The book should be of interest to all people concerned with pro- 
fessional education, particularly in the area of the social sciences. 
It includes a bibliography, as well as an index. 

H. MELTZER 

Human Relations Research Foundation, St. Louis, Missouri 


GzoncENE Seward. Psychotherapy and Culture Conflict. New 
York: The Ronald Press Company, 1956, pp. 299. 


The author speaks of this volume as a collaboration: he supplied 
the textual material and Dr. Judd Marmor supplied the four case 
studies from his own practice. The purpose of the book “‘is to de- 
lineate various cultural aspects of psychotherapy that have 
hitherto been slighted. .. . The major portion of the book is devoted 
to an exposition of the psychodynamics peculiar to members of a. 
variety of subcultures in the United States today.” 

Taking “classical psychoanalysis” as the point of departure, the 
author examines neo-Freudian systems, and attempts to “point 
the way a step further, across the cultural gaps separating various 
minority groups from the mainstream of the culture.” 

Eleven chapters take up in the following order, A Preface to 
Psychotherapy, Old World Accents, The New Look, Social Class 
and Personality, Treatment and Status, Negro Portraits, Color 
and Conflict, Women as a “Minority,” New World Symphony and 
Discord, America’s First Families, and Jews Between Two Worlds. 

It appears that the study of subcultures is now one of the most 
important problems on the frontiers of psychological investigation 
and that psychotherapy is in need of therapists well acquainted at 
least with the subculture involved in any given case. For any treat- 
ment it is believed that a proper understanding of cultural effects 
is n 

The author does not accept psycho-analytical hypotheses un- 
critically and warns against interpreting in terms of wrong theories. 
Emphasis is placed upon concrete examples; definite contribution 
to the understanding of the general discussions is made by the case 
studies of Dr. Marmor. ! 

n iess studies include (1) Flight from Poverty, dealing 
With a thirty-three year old attorney; (2) Color and the Blues, 
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a study of a twenty-seven year old Negro saxophone player; (3) 
A Female Woman Hater, a married woman in her early thirties; 
and (4) A Jewish “Fascist,” bringing out factors involved the fas- 
cist trends of a young American college student. 

The main discussions also include more or less detail relating 
to the cultural problems of the Hopi, Navajo and Sioux Indians; 
Greeks; Japanese; Italians; Chinese; Issei and Nisei; and a few 
others who are included in the very complex matrix of cultures in 
The United States. Space is especially accorded to problems of 
anxiety, authority, childhood and child training, dependency, eth- 
nic identification, family problems, hostility, learning and educa- 
tion, sex problems, social class and mobility, values, and therapy. 

Whether or not the reader is inclined towards psychoanalytic 
interpretations, he will find in this book valuable suggestions for 
better insight into problems of personality with special reference 
to the influence of cultures and the conflicts between cultures. Ex- 
amples given indicate how completely psychotherapy can fail 
when the facts of cultural conflicts are not known and understood. 

Each chapter contains a list of pertinent references. There is an 
index of names (pp. 285-289) and an index of subjects (pp- 


291-299). 
A. S. Epwarps 


The University of Georgia 


THE JOURNAL OF 
EDUCATIONAL PSYCHOLOGY 


Volume 48 October, 1957 Number 6 
————————————————————Oo 
THE LEARNING OF MOTOR SKILLS AS 
INFLUENCED BY KNOWLEDGE OF 
MECHANICAL PRINCIPLES 


Frances M. COLVILLE 


Santa Barbara College, University of California 


Many general principles of mechanies are relevant to the teach- 
ing, learning, and performance of activities included in the physi- 
cal education curriculum. These principles may describe the mo- 
tion of objects such as balls and racquets, movement of the body 


itself, or a combination of both. 
The question has been raised as to whether knowledge of these 
tanding of their application to 


general principles and an unders 
these activities will facilitate the learning and improve the 
performance of pupils in physical education classes. Also of interest 
are questions concerning the influence of knowledge of these 
principles as applied to one activity upon subsequent learning of 
other activities to which the same principles apply. Results of a 
few experiments suggest that a pupil who understands a principle 
related to one skill may master à related skill more readily than 
the pupil whose experience has been restricted to specific instruc- 
tion in technique without explanation of pertinent principles. 

With reference to the teaching of a motor skill in which a specific 
principle of mechanics is involved, then, two questions have been 
raised: 

1. What is the effect of knowledge of a principle upon immediate 

principle applies? 


learning of a skill to which the Sr 
2. What is the effect of knowledge of a principle learned in 


relation to one skill upon subsequent learning of a different, or 
more complicated skill to which the same principle is applicable? 

Reported investigations bearing on the acquisition and transfer 
of principles are limited, and although most authorities agree that 
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general principles are probably transferable, little strong experi- 
mental evidence is available to support such a claim. Apparently, 
little or no experimentation has been done in the field of physical 
education on the effect of knowledge of principles of mechanics in 
learning situations involving body movement and large muscle 
activities. 

Judd (7), and Hendrickson and Schroeder (4) reported that 
knowledge of the principle of refraction was beneficial in dart 
throwing and rifle shooting at a target under water. Cox (2) found 
that knowledge of principles involved in manipulating electrical 
equipment facilitated the assembling and stripping of such equip- 
ment. Other experimenters have reported similar results from 
investigations involving mathematies, mental games, spelling, 
card tricks, and mechanical puzzles (13, 11, 15, 10, 14, 8, 2). In 
contrast to these findings are a number of investigations in which 
the method of including instruction in the principles involved did 
not appear to be more effective. In studying the effect of drill 
versus the learning of generalizations related to addition and 
subtraction, Olander (9) found that the two methods appeared to 
be equally effective. Babitz and Keys (1) found similar results 
with college chemistry classes, and Hendrix (6) reported only one 
difference, significant at the 12% level, in her experiment with 
teaching algebraic generalizations. n 

With the exception of the experiments by Judd, Hendrickson 
and Schroeder, and Cox, all of the reported investigations have 
dealt with various types of learning other than motor learning, 
and only Judd’s experiment involved skills requiring the coordi- 
nated use of large muscles of the body as well as fine coordinations 
of the fingers. Judd, and Hendrickson and Schroeder, while report- 
ing a difference in favor of instruction in principles, included this 
instruction in such a way that additional time was allotted to the 
group receiving the instruction. Thus, the experimental groups 
had the advantage of additional time as well as additional infor- 
mation. Furthermore, Judd supplied no statistical analysis of his 
early experiment, and the other differences reported were signifi- 
cant at a statistical level below that which is usually considere 
acceptable. 1 

In the present investigation, an attempt was made to contro 
these ambiguities by using an experimental design which insure 
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that both the experimental and the control groups spent equal 
amounts of time in learning and performing and by considering 
only those differences which were significant at the 5% level of 
confidence or above. 


Method 


Purpose of the Study 
It was the purpose of this study to investigate certain questions 
ducation activities in which 


related to the teaching of physical e 

specific principles of mechanics are involved. This problem was 
approached by: (a) selecting three principles of mechanics which 
are pertinent to motor skills; (b) selecting three motor skills, each 
of which utilizes one of the principles; (c) establishing for each 


skill two comparable groups of Ss, one of which was taught with- 


out reference to the principle involved, and the other of which 
(d) comparing 


was taught to understand and apply the principle; i 
for each skill the performance of the two groups, one of which 


spent the entire time practicing the skill without reference to the 
f which spent part of the time 


principle involved, and the other o he t 
practicing the skill and part of the time in learning the principle; 
(e) comparing for each skill the performance of the two groups 1n 
a similar or more complicated form of the skill to which the same 


general principle applies. 


Selected Principles and Skills 

The selected principles and the skills in whic 
were as follows: . 

Principle I: The angle of incidence is approximately equal to 
the angle of reflection. Skill: Rolling à ball against a surface, or 
Surfaces, from which it would rebound. — | 

Principle II: In stopping & moving object, the force opposing 
the momentum must be equal to the force of the momentum, and 
if the object is to be caught, this momentum must be dissipated 
by reducing the resistance of the catching surface. Skill: Catching 
a tennis ball in a lacrosse stick and catching a badminton bird on 


a tennis racquet. 

Principle III: An object se 
by an external force is acted UP 
and by gravital acceleration. S 


h they are utilized 


t in forward motion through the air 
on by the force of the momentum 
kil: Archery. 
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Procedure 


Three parallel experiments were devised, each of which was 
designed to investigate performance of one of the three skills. 
The Ss for all three experiments were undergraduate women 
students at the University of Southern California. There were 36 
Ss in the Ball Rolling experiment, 40 in the Catching experiment, 
and 42 in the Archery experiment. (The Ss in the Archery experi- 
ment included some men whose scores were treated separately 
from those of the women.) For each of the experiments the Ss 
were divided into two groups, one of which spent the entire time 
learning and practicing the skill, and the other of which spent 
part of the time learning about the principle and the rest of the 
time learning and practicing the skill. Each experiment consisted 
of two tests. During the First test the Ss of the nonprinciple 
group learned the skill and practiced. The Ss of the principle 
group learned the principle, and the skill, and practiced. The total 
amount of time spent by both groups was the same. During the 
Second test both groups practiced a similar or more complicated 
skill. Each group spent the same amount of time and no mention 
was made of the principle involved. 

The Ball Rolling experiment. The Ball Rolling experiment was 
designed to illustrate the principle describing the rebound of a 
ball. The appratus consisted of a modified pinball machine plunger, 
a squash ball, a felt-covered table surface four feet wide and six 
feet long, and a target made of colored construction paper. The 
overall size of the target and each of its scoring divisions was 
determined by recording the hits made by a group of comparable 
Ss who were not used in the experiment. These Ss rolled the ball 
so that it would rebound from one side of the table as close 35 
possible to the center of the adjacent side which was marked with 
a square of white paper. The rest of the side was covered with 
brown wrapping paper, and spots on this paper to which the balls 
rolled were marked in red pencil. This scoring sheet provided the 
basis for establishing a six-standard deviation scale, and from this 
scale the scoring areas of the target were derived. 

The Ss were asked during the First test to roll the ba 
one side of the table in such a manner that it would reboun 
target placed against the adjacent side. For the Second test they 
were asked to roll the ball so that it would rebound from tw? 
adjacent sides to the target set at right angles on the third side- 


I] against 
d to the 
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The Catching experiment. The Catching experiment consisted of 
two tests each involving the skill of catching a moving object. The 
First test involved catching a tennis ball in a lacrosse stick. For 
the Second test, they caught a badminton bird on a tennis racquet. 
The apparatus for this experiment, in addition to balls, birds, 
crosse, and racquet, included a homemade device for projecting 
the tennis ball and the badminton bird. This device proved to 
have high reliability. 

The Archery experiment. The Archery experiment was planned 
to illustrate the principle of gravital acceleration and its action 
on arrow flight and the relation of line of flight to line of sight. 
The First test involved shooting from 20 yards. The Second test 
included ends shot from 30 yards, from 40 yards, and a Junior 
Columbia Round. 

Analysis of data. In each experiment, the reliability of the 
scores was established by use of the split-half technique, the odd 
scores being correlated with the even scores. Use of the Spearman- 
Brown Prophecy Formula for predicting reliability of the whole 
test resulted in coefficients which ranged from .81 to .94 and thus 
it was felt that these scores were sufficiently reliable to permit 
further analysis and comparison. . 

All of the data were analyzed by means of analysis of variance 
using Edwards’ (3) technique for repeated measurements of the 
same subjects. This technique tests differences between methods, 
between trials, and between the interaction of trials and methods. 
l ability of the two groups, as evi- 


In each experiment the initia [ 
: was tested and found not to differ 


denced by preliminary trials, 
Significantly. 
Findings 


In the Ball Rolling experiment, no significant difference in the 


performance levels of the two groups was found on either test. A 


difference between trials significant at the 176 level of confidence 


Was found on both tests. J- : 

In the Catching experiment, no significant differences Were 
found in performance levels, and a difference between trials signifi- 
cant at the 5% level of confidence was found only on the First test. 

In the Archery experiment no significant difference in perform- 
ance levels was found. A difference significant at the 1% level for 
men and at the 5% level for girls was found on the Second test at 


all distances except forty yards. 
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The following general findings were noted: 

1. A significant amount of learning took place under both 
methods of instruction. 

2. This learning was not only similar in amount but also in 
pattern. Exceptions occurred in the Second test of the Catching 
experiment where no significant increase in scores occurred in 
either group, and in the Junior Columbia Round for men in the 
Archery experiment, where both groups increased in scoring abil- 
ity, but the group which had been taught the principle was signifi- 
cantly better at twenty yards. 

These findings are in general agreement with those of some 
published investigations and are opposed to those of some others. 
However, most of the reported experiments have dealt with various 
types of learning other than motor learning. The ability to use 
understood principles in performing a motor skill presents a com- 
plex problem since more than one type of learning is involved. AS 
Ragsdale (12) has pointed out, a learner may understand a prin- 
ciple when applied to inanimate objects, but he may not under- 
stand and be able to apply it to his own movements. In general, 
it seems that the findings of this study support Ragsdale’s obser- 
vation, at least in the initial stages of acquiring an unfamiliar 
motor skill. 


Conclusions 


Within the limitations of the three parallel experiments which 
constitute the present investigation, there is no evidence: , 

1. That instruction concerning mechanical principles utilized in 
the performance of a motor skill facilitates the initial learning of 
the skill to any greater extent than an equivalent amount of time 
spent in practicing the skill. 

2. That such knowledge facilitates subsequent learning 85 
evidenced in the performance of a similar or more complicated skill 
to which the same principle is applicable. : 

However, since it appears that some part of the learning period 
may be devoted to instruction concerning general principles with- 
out detriment to the motor learning of the students, it would seem 
desirable to include such instruction in order to provide this addi- 
tional opportunity for acquiring some related knowledge about 
principles of mechanics and the application of forces. 
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STUDENT VS. INSTRUCTOR DESIGN OF 
STUDY METHOD": 


SLATER E. NEWMAN? 


Air Force Personnel and Training Research Center, Colorado 


The assigning of study tasks to be done by the student either 
in or out of class is common educational practice. Such practice, 
however, implies an assumption that each student can best pre- 
scribe for himself how such material should be studied. 

It was the purpose of the present investigation to demonstrate 
that such an assumption is not warranted; that rather, the use of 
study materials and procedures designed on the basis of informa- 
tion provided by research on learning will lead to faster student 
learning than the use by each student of his own set of techniques. 


Procedure 


The task confronting each student was that of learning the 
names for each of 20 electrical symbols. This task was chosen be- 
cause it appeared to be representative of the kind of task on which 
considerable research-based information already exists. The fol- 
lowing schedule was used with each S, all of whom were run individ- 


ually: 


1. A pretest was administered to determine how many of the 20 symbols 
the student could name correctly. The papers of those students who knew 
the names of more than three symbols were not used in the analysis of post- 
test scores. 


1 This report is based on work done under ARDC Project No. 7715, Task 
No. 77283, in support of the research and development program of sus 
Air Force Personnel and Training Research Center, Lackland Air Force 
Base, Texas. Permission is granted for reproduction, translation, publica- 
tion, use, and disposal in whole and in part by or for the United States 
Government. The opinions or conclusions expressed herein are those of the 
author. They are not to be construed as necessarily reflecting the views 07 
endorsement of the Department of the Air Force or of the Air Research anc 
Development Command. 

? A paper, based on some of the findings reported here, was presented ab 
the 1956 meetings of the American Psychological Association. L 

? Now with the Department of Psychology, North Carolina State Co 
lege, Raleigh, North Carolina. 
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2. Instructions dealing with what the student should do during the learn- 


ing period were read aloud by the instructor. 

3. Each student studied the list either for 734 or for 12 minutes. It was 
anticipated that such differences in study time would produce statistically 
significant differences in performance on the posttest and that these differ- 
ences might be used as a rough yardstick in gauging the effects of the kind 


of procedure used during the study session. - 
4. The posttest (which was the same as the pretest) was then adminis- 


tered. All 20 symbols were presented on a sheet of paper, and the student 
was asked to write the name of each symbol he knew to the right of the 
symbol, The posttest was administered immediately after the study period. 


Students were randomly assigned either to a Student Group or 
to an Instructor Group. The procedures used during the study 
period with each group were the following: . 

Student Group (N = 15). At the beginning of the study period, 
the student was given a sheet on which were listed each of the 20 
electrical symbols and its name, and he was asked to study the 
list. Each name was to the right of its symbol. The order in which 
these symbols were arranged on the study sheet differed from that 
in which they appeared on the test sheet. Seven students were 
allowed 734 minutes for study, and eight students were allowed 
12 minutes for study. The instructor was present during the entire 
study period. 

Instructor Group (N = 


used with this group were obtained p 1 
With the results of research on learning the kind of task employed 


here. To each psychologist the symbol-name pairs to be learned 
Were presented, and he was asked to suggest techniques which 
could be expected to facilitate the learning of these pairs. There 
Was not, however, complete agreement among the five psychologists 
as to techniques which should be employed. Thus, the combination 
of techniques used with the Instructor Group was based on a 
Selection by the author from among those suggested. For the final 
Procedure, file cards were prepared on which appeared either the 
Symbol and its name, the symbol alone, the name alone, or the 
name of the set (to be described next) to which a symbol belonged. 
The 20 symbols were divided into four sets of five each. All symbols 
in a set shared one feature in common. For set one, it was a circle; 
for set two, a wavy line; for set three, a black dot; and for set four, 
a straight line. Early in training, study of set one was completed 
before study of set two was begun, etc. Approximately 10 seconds 


15). Suggestions for techniques to be 
from five psychologists familiar 
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intervened between study of each set. These materials were used 
in a five-step procedure as follows: 


Step 1. Each of the five names was presented individually for approxi- 
mately five seconds (e.g., capacitor). The instructor pronounced each name 
and then had the student pronounce the name. 

Step 2. The set name was then presented (e.g., straight lines). For this 
and subsequent steps, a three-second rate of presentation was employed 
for each card. 

Step 3. Each of the five symbol-name cards was then presented (e.g. 
capacitor symbol-capacitor). The student pronounced each name and was 
corrected if he erred. 

Step 4. Each card with the symbol alone was then presented (o.g., capaci- 
tor symbol), and the student responded with the name of the symbol if he 
felt he knew the name. 

Step 5. After these steps had been followed with each of the four sets, 
each of the 20 symbols was presented alone and was followed by a card on 
which appeared the symbol and its name. 


With seven students, this set of procedures was used once (total 
study time for each student approximately 734 minutes). With 
eight additional students, this set of procedures, excluding Step 1 
(i.e., response learning) was then repeated a second time. The total 
study time for this group was approximately 12 minutes. 


Subjects 


Thirty airmen, each of whom had recently completed basic 
training, were randomly assigned to one of the experimental groups- 
All Ss had a score of 5 or above on the Airman Classification 
Battery’s Technical Specialty Aptitude Index (a measure of general 
intellectual ability). 


Results 


The random assignment of students to groups did not result in 
significant between-group differences in aptitude. Nevertheless, 
each student’s score on the posttest was adjusted on the basis of 
his aptitude index. A 2 X 2 analysis of variance (i.e., type of 
study X amount of study time) of these adjusted scores was then 
accomplished and is presented in Table 1. The Fs for both type of 
study and amount of study were found to be significant at the 01 
level. The mean scores for each cell of the design are presented in 
Table 2. 

The significant F for type of study indicates that there were 
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TABLE 1 
ANALYSIS OF VARIANCE OF ÁDJUSTED SCORES 


Source df Mean Square F ? 
Type of study 1 10,046.70 10.51 .01 
Amount of study time T 12,080.38 12.64 .01 
Interaction 1 234.00 .24 
Within 26 955.50 
Total 29 
TABLE 2 


Mean APTITUDE AND RETENTION Test Scores ror Each GROUP 


Mean 
OMS Retention 

core 
Group study Time N ndex (Unadjusted) — SD* 
Instructor 734 min. 7 7.4 9.6 5.33 
Student Dmm. 7 67 141 1.59 
Instructor 12 min. 8 6.8 13.5 2.91 
Student 12 min. 8 7.1 17.0 2.00 


* Bartlett’s chi square for these data is 7.36 (.10 > p > .05). 


real differences between the average performance scores of the 
Student and the Instructor Groups. The differences, however, 
Were not in the direction anticipated. Instead, students who studied 
using their own techniques did better on the posttest than students 
for whom study techniques were prescribed. 


. A second study was then conducted to determine whether the 
Introduction of certain changes in the procedure used with the 


Instructor Group would result in faster learning by Ss of this group 
than by Ss who studied using their own set of techniques. Two 
Changes in procedure were introduced with the Instructor Group: 
(a) prompting training was used throughout Ge., immediately 
following presentation of a stimulus-response pair, the stimulus- 
term was presented alone, S being required to respond with its 
Tesponse-term); thus, S was never tested during training, and 
(b) the items were presented in a fixed order, both within sets 
and between sets, This revised set of procedures was used with 
26 Ss. A Student Group (V = 26) received the same treatment 
as the Student Group in the first study. The Ss in each group were 
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allowed 724 minutes for study. A median test applied to the post- 
test scores of both groups resulted in a chi square of 7.69 (p « .01), 
the performance of the Student Group again exceeding that of the 
Instruetor Group. 


Discussion 


An examination of the differences in procedures used with the 
Student and the Instructor Groups might shed some light on these 
performance differences. Of particular interest is the fact that for 
members of the Student Groups, all symbols and names were 
present throughout the study period. The implications here are 
that (a) this probably allowed for more opportunity to observe 
differences between symbols and between names than was the case 
with the Instruetor Groups; and (b) students could spend more 


time studying those symbols and names with which they were 


having difficulty. 

A second difference between the groups is that members of the 
Student Groups were probably more familiar with the techniques 
which were used (and which each student selected) during the 
learning period than were students of the Instructor Groups. 
Thus, the posttest performance differences between the Student 
and Instructor groups may not have been so great had students of 
the Instructor Groups had some practice in learning how to learn 
from the techniques used during their study period. 

Finally, the manner in which the techniques were combined for 
use with the Instructor Groups may have led to slower learning 
(e.g., the relatively little emphasis on response learning). . 

At least two conclusions might appear warranted on the basis 
of the results of these studies: (a) that allowing the student to 
decide on the techniques he will use during study on tasks like 
that of learning the names for electrical symbols is good educational 
practice, and (b) that in respect to the design of materials and pro- 
cedures for learning on such tasks, the experimental psychology of 
learning has little to offer. Such conclusions do not appear tO 
present a complete picture. . 

The situation, rather, may be described like this: The effective 
design of materials and procedures for tasks even $0 apparently 
simple as learning the names of electrical symbols is not easy- 
Much information is already available from laboratory research 
on learning which might prove useful in designing materials for 
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such tasks. However, until considerable experience is accumulated 
about how such information can best be applied to the design of 
materials and procedures for learning in classroom type situations 
(or until additional research-based information is available), the 
practice of allowing the student to decide on the techniques he 
will employ during study of the kind of task used here may be a 


defensible one. 


Received December 10, 1956. 


RETROACTIVE INHIBITION AND FACILI- 
TATION IN THE LEARNING OF 
SCHOOL MATERIALS 


Ld 
Davip P. AusuBEL, LILLIAN CUKIER RonBINS, AND ErrAs BLAKE, JR. 


University of Illinois 


This study is generally concerned with some of the major deter- 
mining factors and processes involved in the retention of typical 
school materials. More specifically, it deals with (a) the relative 
importance of proactive and retroactive inhibition in the forgetting 
of such materials, (b) the role of similarity between original and 
interpolated tasks in retroactive inhibition, and (c) the relative 
facilitating effects of simple repetition and explicit retroactive 
enhancement of the discriminability of learning material on its 
subsequent retention. 

The paradigm of retroactive inhibition is congruent with the 
highly paradoxical “interference” theory of forgetting. According 
to this theory, the introduction of identical material subsequent to 
learning results in consolidation and an increment in performance 
but the more similar the subsequently introduced material is to 
the original learning task, short of complete identity, the more it 
supposedly interferes with the retention of the latter. The resulting 
decrement in performance (as well as the interfering processes 
responsible for the decrement) is designated as “retroactive inhibi- 
tion." 

Practically all of the experimental work on retroactive inhibition 
has been conducted with materials (e.g., nonsense syllables, lists 
of paired adjectives) that are quite far removed from the type © 
learning that occurs in the classroom. 'This approach has not been 
completely without rationale. First, nonsense syllables, as additive 
units of equal difficulty, make for greater quantifiability. Second; 
because they are relatively meaningless, their use makes it possible 
to rule out the indeterminable influence of meaningful antecedent 
conditions that obviously vary greatly from one individual tO 
another. Unfortunately, however, although the use of nonsense 
syllables adds undoubted methodological rigor to the study O 
retroactive inhibition, their very nature limits the applicability ° 
findings to the retention of meaningless materials—or to à type © 
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learning that is rare both in everyday situations and in the school 
room. Nevertheless, despite the absense of any convincing a priori 
grounds for believing that retroactive inhibition occurs in the same 
way for meaningful as for relatively meaningless (arbitrary, rote, 
diserete) learning, findings from the latter kind of situation have 3 
been commonly and uncritically extrapolated to the former. 
What reasons are there for suspecting that retroactive inhibition 
might operate differently for meaningful than for meaningless 
materials? First, nonsense syllables are discrete and isolated entities 
that are almost completely unrelated to the vast reservoir of con- 
cepts and information that a pupil brings with him to a learning 
situation. They are relatively unaffected by previous learnings; 
and, hence, any forgetting that results from the studying of sub- 
sequently introduced materials (over and above the forgetting of 
an equated group that “rests” between initial test and retest) can 
be plausibly attributed to the interfering influence of these latter 
materials. This is simply not the case in school learning. Almost 
any new meaningful material is related in some way to what has 
been learned in the past; in fact, the curriculum is deliberately 
organized in this fashion to provide for the gradual and untrau- 
matic introduction of new facts and concepts. Hence, it seems rea- 
sonable to suppose that any school learning will involve interaction 
with established knowledge in the learner, and that this interaction 
will probably have greater influence on retention than the immedi- 
ate interactive effects of material introduced subsequent to such 
learning. Furthermore, the retroactively introduced materials also 
have significant connections with the residue of prior learning 
experience. Thus, whatever effect they do have on retention is 
probably less a reflection of interaction with the learning task in 
their own right than of the activation and mobilization of relevant 
subsuming concepts in the individual’s existing cognitive structure. 
Second, in dealing with meaningless material that cannot be 
related to established concepts in cognitive organization, a simple 
paradigm of retroactive interference, e.g., stimulus or response 
generalization, can adequately explain forgetting. Meaningful 
materials, however, are invariably related to an existing cognitive 
Structure that is hierarchically organized in terms of highly stable 
‘and inclusive conceptual clusters under which are subsumed less 
stable and more specific illustrative data. As they enter the cogni- 
tive field, they are appropriately “catalogued” under relevant 
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conceptual foci, since this is the most orderly and efficient way of 
organizing large quantities of material for ready availability. At 
first, for a variable period of time, the recently catalogued data 
can be dissociated from their subsuming concepts and are repro- 
ducible as individually identifiable entities. Because it is most 
economical and least burdensome, however, to subsume as much 
material under as inclusive categories as possible, the import of 
many specific illustrative items is assimilated by the generalized 
meaning of the more established and highly conceptualized sub- 
suming foci. When this happens, these items are no longer dis- 
sociable or identifiable as entities in their own right and are said 
to be “forgotten.” Hence, barring repetition or some other special 
reason, e.g., primacy, uniqueness, affective content, for the per- 
petuation of dissociability strength, specific items of experience 
that are supportive of an existing conceptual entity tend eventually 
to be irreversibly subsumed by the generality of the latter. In 
other words, it is proposed that subsumability under established 
concepts, rather than retroactive interference due to similarity 
per se, is the determining factor in the forgetting of meaningful 
learning material. 

Third, similarity between original and interpolated materials 
presumably bears a different relationship to retroactive inhibition 
when the materials involved consist of conceptual data and related 
information than when they consist of rote, nonsense tasks. In 
the latter instance, specific differences in content, in the absence 
of any transferable conceptual meaning, completely obviate the 
possibility of retroactive facilitation and only generate confusion. 
It is little wonder, then, that degree of similarity (short of identity) 
between original learning task and interpolated material is pos! 
tively related to degree of retroactive inhibition. Meaningful ma 
terials, however, in addition to being characterized by specific 
detail and context, have a more general conceptual content that 
is transferable to comparable learning situations. Hence, context- 
ually dissimilar but substantively similar interpolated Jearnir 
can retroactively facilitate the retention of previously learned ma- 
terials. 

A final theoretical issue that we shall exp 
do with the mechanism of retroactive facilitation in the retention 
of meaningful material. By retroactive facilitation is meant the 
effect of interpolated material in reducing the retention loss cause 
by proactive inhibition. It is demonstrated when, after interpolate 


lore in this study has to 
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learning, significantly less loss occurs from immediate test of 
original retention to later retest than occurs when no learning task 
is interpolated between test and retest.! Consideration of the “sub- 
sumption-dissociability” theory of forgetting proposed above sug- 
gests that a given ideational element is irreversibly subsumed under 
an established concept not because of similarity between them per 
se, but because its distinguishing features are not sufficiently salient 
to enable it to persist as a discriminably distinct entity in its own 
right. Thus, it is conceivable that identical or substantive repetition 
of the learning material induces retroactive facilitation by increas- 
ing its discriminability from already established concepts. Another 
possible way of retroactively enhancing the discriminability of 
learning material is explicitly to point out similarities and differences 
between it and potentially subsuming concepts. Since this latter 
method presumably results in more explicit appreciation of the 
distinctive properties of the learning material, it should lead to 
greater retroactive facilitation than an equivalent amount of simple 


repetition. 
Hypotheses 


Proceeding from the foregoing theoretical considerations, there- 
fore, an experiment was designed to put to empirical test the fol- 
lowing hypotheses regarding the retention of meaningful school 
materials: (a) Proactive inhibition, i.e., irreversible subsumption 
of learning material by established concepts, accounts for more 
forgetting than does retroactive inhibition. (b) The interpolation 
of substantively similar (but not identical) material between initial 
test and retest results in retroactive facilitation rather than retro- 
Active inhibition. (c) Less retroactive facilitation occurs when the 
interpolated material merely recapitulates the learning passage 
than when it explicitly delineates similarities and differences be- 


tween the latter and presumed subsuming concepts. 


. Method 
Subjects 

An experimental group and three control groups were constituted 
out of a total population of 188 undergraduates in eight sections 
Se 


! If sufficient retroactive facilitation occurs between test and retest, as 
when the learning task is repeated frequently during the interpolated ses- 
sions, an actual increment rather than a reduced decrement results. 
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of a course in educational psychology at the University of Illinois. 
The mean ages of these groups varied from 21.8 to 22.4. The ex- 
periment was rationalized as a laboratory exercise in the psychology 
of learning and was performed during regularly scheduled class 
hours. In order to maximize ego involvement, the Ss were inf ormed 
that after the data were processed their individual scores, as well 
as the class results, would be reported to them. 


Learning Material and. Test of Retention 

The learning material used in this study was a passage of ap- 
proximately 1700 words on the history, sacred literature, doctrine, 
and ethical teachings of Buddhism. It was specially designed for 
use in an experimental learning situation that would bring into 
play the same kinds of cognitive variables and interactional proc- 
esses that affect the long-term retention of typical ideational ma- 
terial learned in a school setting. Hence, it had to satisfy the major 
criterion of being largely unfamiliar in its specific aspects but still 
generally related to a body of knowledge relatively well established 
in most of our Ss. On purely a priori grounds it seemed reasonable 
to assume that the facts and ideas contained in this passage would 
interact with and be subsumed under existing concepts of Chris- 
tianity, and that the latter concepts would furnish the basis for 
whatever proactive and retroactive inhibition might occur in the 
course of learning and retention. : 

A 37-item multiple-choice test was constructed to test immediate 
and later retention of the Buddhism material. Preliminary tryouts 
of the passage and the test indicated that the material was actually 
unfamiliar but learnable, and that the test was reliable and struc- 
turally sound. Scores on the test showed a satisfactory range o 
variability and were distributed normally. The corrected odd-even 
coefficient of equivalence was .81. When the test was administered 
to naive Ss who had not previously studied the passage, the mean 
score (number of correct items) was only 8.9 as compared to ? 
chance expectancy score of 7.4 (one-fifth of 37). 

Because of the limited duration of class periods, the passage 
could not be made long enough to provide à sufficient number 9 
conceptual items for two reasonably adequate tests. Practical con^ 


> M > sa 
siderations, therefore, made it necessary to use the same test a ^ 
measure of both immediate and later retention. Hence, it 25 an e 

ted in som 


ceivable that memory of answers given initially compe 
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instances on the retest with the Ss’ recollections of the material 
itself. The probability of this happening, however, was minimized 
by the eight-day interval between tests and the difficulty of specif- 
ically remembering one of five highly similar and complexly worded 
alternatives. In any event, this factor operated in similar ways for 
the experimental and control groups. 


Experimental Procedure 

The Ss in all groups were first instructed to read the Buddhism 
passage at their normal reading speed (eight to ten minutes) and 
then to utilize the remainder of the 25 available minutes to com- 
mit the concepts and faets in the passage to memory. It was ex- 
plained that a multiple-choice test would be given immediately 
after the reading was completed and also eight days later. Both 
tests were given as announced. 

Twenty-four hours after the initial test, 20-minute sessions of 
interpolated learning were introduced as follows: The Experimental 
Group studied a 2100 word comparative essay on Buddhism and 
Christianity. This essay encompassed all of the material included 
in the Buddhism passage, but similarities and differences between 
the two religions were explicitly pointed out. Control Group A 
restudied the Buddhism passage. Control Group B studied a 1700 
word passage on the history, sacred literature, doctrine, and ethical 
teachings of Christianity. In each instance it was explained that 
knowledge of the material studied during the interpolated session 
would be tested for a week later by the previously announced 
retest. Control Group C was not subjected to an interpolated learn- 


ing session. 
Results and Interpretation 

Means and standard deviations of all groups on the initial test 
and retest are shown in Table 1. Since raw scores on this multiple- 
choice test include correct answers attributable to guessing, inci- 
dental knowledge, and the presence of unintentional cues in the 
five alternative answers, corrected means were computed by sub- 
tracting from the raw means the mean score of Ss who took the 
test without studying the Buddhism passage. These corrected 
Means were utilized in making all intragroup and intergroup com- 
parisons of test scores. The distribution of test scores did not di- 
verge significantly from the normal. F tests for homogeneity of 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF IMMEDIATE AND LATER TESTS OF 
RETENTION FOR EXPERIMENTAL AND CONTROL GROUPS 


Percent- 


Test I Test II age Re- 
tained 

Interpolated Raw Corr. Raw  Corr.^ (Test IT/ 

Group Material N Mean Mean? SD Mean Mean SD Test 1) 
Experimental Comparative 43 24.4 15.5 5.47 25.6 16.7 5.25 107.7 
Control A Identical 50 22.5 13.6 6.14 23.2 14.3 5.75 105.1 
Control B Subsuming 44 24.7 15.8 4.79 21.9 13.0 4.83 82.3 
Control C None 51 24.5 15.6 5.84 22.0 13.1 6.13 84.0 


"Corrected means were computed by subtracting 8.9 (the mean score of Ss who took 
the test without studying the learning material) from the raw means. 
‘Corrected means were utilized in computing the percentage retained from test to 


retest. 
variance between groups indicated that we were dealing with com- 
parable groups (i.e., drawn from the same population). 

The relatively high degree of retention (8496) demonstrated by 
Control Group C after an eight-day interval without any inter- 
vening repetition can be attributed to several factors. First, the 
gradient of forgetting is known to be very gradual when meaningful 
material is memorized. Second, the availability of memory traces is 
maximized when a recognition rather than a recall method of re- 
tention is employed. Third, a type of reminiscence effect, was par- 
tially operative as frequently becomes evident when immediate 
retention of partially learned, highly conceptualized material 19 
compared to retention several days later. Dissociability of newly 
subsumed materials presumably tends to be at a minimal leve 
immediately after the initial shock of interaction with their sub- 
suming concepts, and improves spontaneously for a limited time 
thereafter as the temporary confusion generated by the interaction 
gradually dissipates. Thus, although the eight-day interval in this 
instance undoubtedly exceeded the temporal limits of the spo?” 
taneous recovery phase, the ratio of later to immediate retention 
was probably still inflated by the temporary inhibition of retention 
immediately after the completion of learning. 


Proactive versus Retroactive Inhibition 


To test the hypothesis that most forgetting of meaningful 
material is reflective of proactive rather than retroactive inhibitioD; 
the retest scores and percentages of retention (Test II/Test Do 
control groups B and C were compared (see Table 1). Contro 


Group B had studied an interpolated passage on Christianity, 16 
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the material potentially most capable of inducing retroactive in- 
hibition in relation to the Buddhism learning passage. Control 
Group C had not been exposed to an interpolated learning session. 
Thus, in the first instance, both retroactive and proactive inhibition 
were operative, whereas in the second instance only proactive in- 
hibition was operative. The data show conclusively that subsequent 
study of relevant subsuming material results in only negligibly more 
forgetting of the original learning material than when the latter 
material interacts spontaneously with established subsuming con- 
cepts that the individual brings with him to the learning situation. 
Since most ideational material that is learned in a school setting is 
related to and hence potentially subsumable by an existing body 
of stable concepts in the learner’s cognitive organization, this find- 
ing is generally applicable to school learning. It suggests that 
retroactive inhibition is an important determining factor in for- 
getting only in artificial laboratory situations employing nonsense 
learning tasks, and is at most a relatively minor variable in the 
retention of most newly learned school materials. 


The Role of Similarity in Retroactive Inhibition 

That similarity short of identity between learning and inter- 
. polated tasks does not cause retroactive inhibition in the case of 

meaningful material, but may or may not induce retroactive 
facilitation, is evident from intergroup comparisons of the retention 
data of the Experimental Group and Control Groups B and C 
(see Table 1). This type of analysis supports the view that the 
influence of interpolated learning on retention is & function of 
whether or not it increases the discriminability of the original 
learning material and thereby counteracts irreversible subsumption 
of the latter by established concepts. As already pointed out, al- 
though interpolation of similar subsuming material (Christianity 
passage) apparently does not induce retroactive inhibition, neither 
does it reduce proactive inhibition. A t test between the retest 
scores (or the percentages of retention) of the Experimental Group 
and Control Group C, on the other hand, shows that the inter- 
polation of similar comparative material results in a highly signifi- 
cant amount of retroactive facilitation (p < .001). Despite the 
seven-day interval between interpolation and retest, slightly more 
(7.7%) material was remembered on Test II than on Test I. The 
net conceptual residue of the comparative passage, which consisted 
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of recapitulated Buddhism material, subsuming Christianity ma- 
terial, and an explicit comparison between the two religions, un- 
equivocally increased the discriminability of the original Buddhism 
material. It appears, therefore, that in dealing with meaningful 
school learning, the absence of point-for-point-identity between 
original and interpolated learning tasks not only fails to induce 
retroactive inhibition but is also compatible with the occurrence 
of retroactive facilitation as long as the interpolated material en- 
hances the discriminability of the learning task. 


Repitition versus Explicit Comparison 


To what can the facilitating effects of the interpolated compara- 
tive passage be attributed? One possible explanation, emphasized 
in our theoretical discussion above, is that explicit delineation of 
the similarities and differences between the learning passage (Bud- 
dhism) and its presumed subsuming concepts (Christianity) €n- 
hanced the discriminability of the former and made it more resistive 
to irreversible subsumption by the latter. However, our experimen- 
tal design had to take into account the obvious fact that a compara- 
tive presentation must of necessity recapitulate in a different context 
the very same ground covered by the original learning material. 
It is patently impossible to compare Buddhism to Christianity 
without discussing Buddhism. Hence, it could be plausibly argue 
that the retroactive facilitation following interpolation of the cont 
parative passage was largely induced by the recapitulative features 
of the latter material; and since there was, in fact, no significant 
difference between the Experimental Group (comparative inter- 
polation) and Control Group A (repetitive interpolation) in toe 
ratio of later to immediate retention scores (see Table 1), this 
argument appears incontrovertible. It must be concluded, there- 
fore, (a) that most if not all of the retroactive facilitation associate 
with the interpolation of the comparative passage can be attribute 
to substantive repetition of the learning passage in another contexts 
and (b) that retroactive use of comparative material is no more 
efficacious in counteracting proactive inhibition than simple a 
capitulation of the learning material itself. These conclusions, 9 
course, do not detract from the finding that, where meaning!" 
learning is involved, interpolation of nonidentical material not 


only does not necessarily result in retroactive inhibition but may 
also induce retroactive facilitation. 
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The failure of the comparative approach to reduce proactive in- 
hibition any more than was accomplished by simple repetition can 
be attributed perhaps to the fact that this method may only be 
effective in preventing irreversible subsumption caused by initial 
lack of discriminability. Once material, by virtue of such initial 
limitations in discriminability, can no longer be dissociated from 
its subsuming concepts at the time of original learning, repetition 
may be the only possible way of retroactively enhancing discrim- 
inability. This explanation is being tested presently by means of a 
research design which contrasts the retention of Ss initially learning 
the comparative passage with the retention of matched Ss initially 


learning the Buddhism passage. 


Summary and Conclusions 


Retroactive inhibition and facilitation in the retention of mean- 
ingful school materials was studied in a population of 188 under- 
graduates divided into an experimental and three control groups. 
‘All Ss studied a 1700-word passage on Buddhism and were tested 
multiple-choice examination for retention of same 
and eight days later. Interpolated learn- 
24 hours after original learning and 
Experimental Group learned a passage 
explicitly comparing Buddhism and Christianity. One control group 
restudied the Buddhism passage, and another control group studied 
a passage dealing solely with Christianity. The remaining control 
group was not subjected to any interpolated learning. 

Intergroup comparisons of the ratios between initial and later 
retention supported the hypotheses that, where meaningful school 
material is involved, (@) proactive rather than retroactive inhibition 
is the determining factor in forgetting; and (b) interpolated learning 
of material substantively similar to but not identical with the origi- 
nal learning task does not result in retroactive inhibition, but 
induces as much retroactive facilitation as identical repetition 
of the learning material. Explicit comparison of the learning ma- 
terial and its presumed subsuming concepts was: Ro more effective 
than simple repetition of the learning material in inducing retro- 
he findings were interpreted in the light of a 


active facilitation. TI ; 
“subsumption-dissociability” theory of retention and forgetting. 


by means of a 
immediately after learning 
ing sessions were instituted 
werearranged as follows: The 
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THE IDENTIFICATION OF NUMBER ANXIETY 
IN A COLLEGE POPULATION 


RALPH Mason DREGER 

Jacksonville University 

Lewis R. AIKEN, JR. 
Emory University 


Many persons report in clinical sessions and in academic classes 
that they are emotionally disturbed in the presence of mathemat- 
ies. It has been the writers’ experience that reactions of this 

nature to mathematical and arithmetical materials constitute & 
major field of emotionality in academic situations. And yet, al- 
though many studies have been conducted to determine optimum 
ways of presenting arithmetic materials (7), almost no controlled 
research has been attempted in the realm of emotional problems 
associated with arithmetic and mathematics. Schonell (12) ob- 
served that backwardness in arithmetic is due as much to emo- 
tional as to intellectual factors, and psychoanalytic writers (6, 8, 
11) suggest that failure in arithmetic may be related to maternal 
overprotection. . 

Inasmuch as there has not been any attempt to determine if 
actually there is a set of reactions to numbers and mathematics 
on the scale that appeared evident to the writers, an inquiry was 
called for to investigate the extent and nature of such -—- 
Consequently, the Study reported here is an endeavor to detec 
the presence of a Syndrome of emotional reactions to arithmet? 
and mathematics, tentatively designated “Number Anxiety.” We 
Speak of a syndrome, for anxiety is probably more accurately de- 
scribed as “anxieties” rather than “anxiety” (2). The existence © 
Specific situational anxieties has been shown in several ne ot 
tions (8, 13). Moreover, O'Connor's (10) finding of five am 
factors 9n the Taylor Scale of Manifest Anxiety shows that — 
ensions of anxiety are measured by this popular scale. Numbe 
Anxiety, too, may turn out to be a complex variable. 


Hypotheses Vds 
formulated on the basis of previo 


: its 
existence of Number Anxiety and ! 
uman functioning: 
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Three hypotheses were 
observation concerning the 
relations to Concomitant hi 
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1. Number Anxiety exists as a factor distinct from (although it 
may accompany) “general anxiety.” 

2. Number Anxiety is not related to intelligence as measured 
by the Wechsler-Bellevue or ACE scales, although it is probably 
negatively correlated with ACE Q scores. 

3. Persons with high Number Anxiety will tend to make lower 
scores in mathematics than others, even though intelligence (as 
measured above) between these two groups is not significantly 


different. 
Subjects 


Seven hundred and four students in basic mathematics classes 
at Florida State University. 


Procedure 


The Taylor Manifest Anxiety Scale (14), omitting three items 
of relatively low validity and including three items (Table 1) 
specifically designed to measure feelings of anxiety concerning 
working with numbers, was administered to the 704 students at 
the close of class periods during which mathematics tests had been 
returned to them. The Ss were informed that the results of the 
Taylor Scale, called the “Confidential Data Sheet," would be 
used for scientific purposes and would have no influence on their 
grades or careers. 

Scores on the three Number Anxiety items were correlated 
separately and together with the total score on the 47-item Taylor 
scale. In addition, intercorrelations were obtained among the 
three Number Anxiety items and the 14 items on the Taylor scale 
shown in two previous studies (1, 5) to be the most valid, making 
a 17 X 17 correlation matrix. This matrix was subjected to a 
Holzinger and Harman cluster analysis (4). 

According to total scores on the three-item Number Anxiety 
items and the 47-item Taylor Scale, the 704 Ss were divided into 
four groups, those high in both “general anxiety" and Number 
Anxiety, those high in one and low on the other and vice versa, 
and those low in both. From each of these four groups ten Ss 
were chosen by a table of random numbers for further investiga- 
tion. Inasmuch as males and females were not significantly differ- 
ent in scores on either scale, the sex variable was randomized. The 
40 students in the subsample were not informed of the basis of 
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selection, and were presumably ignorant of the connection between 
the administration of the “Confidential Data Sheet” and the fol- 
lowing procedures. The Ss in the subsample were given Form I of 
the Wechsler-Bellevue Intelligence Scale (15) by one examiner. At 
the same time as the administration of the verbal subtests, pre- 
and poststimulus readings were taken on a psychogalvanometer 
registering palmar skin resistance (9). Besides the recording of 
GSR deflections for each subtest item, the deflection was noted 
for the arithmetic subtest instructions, “I want to see how good 
you are in arithmetic.” . 
The final Mathematics 105 grades for the 704 students and their 
American Council on Education Psychological Examination scores, 
Q, L, and Total, were obtained and correlated with Number 
Anxiety scores. Correlations were also found comparing grades, 
ACE, and Wechsler-Bellevue Verbal, Performance, and Full 


Scale IQ's with Number Anxiety scores for the 40 Ss investigated 
more intensively. 


Results 


Number Anxiety as defined by the three scale items (Table 1) 
proved to be present in approximately one-third of the 704 stu- 
dents; 239 or 35% responded Positively to two or three of the key 
items. 

With some minor differences, scores on the 47-item Taylor im 
formed a distribution very similar to that of Taylor's origina 
Score distribution. Like the original distribution, the one obtained 
in this study is positively skewed; it was assumed, however, that 
it probably represents a normal distribution sufficiently well to 
allow correlations which employ a dichotomy of scores. Correla- 


TABLE 1 T 
- ” [TEM 
Biser, CorRELATIONS BETWEEN THREE “NuMBER Anxiety” IT 
AND TOTAL SCORE on Tayror Scare Spurr ar MEAN 


Correlation with 


Item Taylor Scale 
" ue often nervous when I have to do arithmetic. .32 
‘Many times When I see a math problem I just 37 
reeze up,” 
m never as good in math as in other subjects. .09 
Total, item: 


8 3, 9, 38 combined = 


— 
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TABLE 2 


FACTOR LOADINGS IN “LITTLE FACTOR ANALYSIS" OF 
THREE “NUMBER ANXIETY” ITEMS 


Factors 


Item A B 
3 —.002 .552 
9 .266 .324 

38 -518 -001 


tions between the Taylor scale and the Number Anxiety items, 
individually and in toto, are presented in Table 1. 

The relatively high correlations among the Number Anxiety 
items (Table 3) led the writers to suppose that they represent a 
cluster or syndrome; but the fact that they are not any more highly 
intercorrelated further suggested that the three items are not 
factorially pure. With the full recognition that a factor analysis 
of three items is purely an exploratory measure, a factor analysis, 
which was termed a “little factor analysis” to distinguish it from 
a serious one, was performed employing only the Number Anxiety 
items 3, 9, and 38, by means of Thurstone’s centroid method. Two 
factors accounting for all of the common factor variance emerged. 
Orthogonal rotation allowed for no psychological meaningfulness, 
but oblique rotation to simple structure (as far as the latter can 
be obtained with three items) yielded greater significance (see 
Table 2). Factor A was identified very tentatively as “Negative 
math. reaction” and Factor B as “Nervousness in the presence of 
math.” . 

A cluster analysis (4) of the correlation matrix (Table 3) of 14 
Taylor Scale and three Number Anxiety items revealed three 
distinet clusters, tentatively labelled: I. Compulsive inattentive- 
ness (composed of items 6 and 41 from the Taylor Seale); II. 
Covert, conscious strain (items 47, 45, 42, 48, 40, 44, 21, 5, 27, 29, 
49, and 34); and III. Nervousness and difficulty in arithmetic 
(items 3, 9, and 38). The second cluster approaches “general 
anxiety" and the third clearly represents what is called here 
Number Anxiety. ; 

Log conductance changes of the GSR (9) subjected to analysis 
of variance revealed that only on the arithmetic subtest and 
arithmetic instructions were there significant F ratios (at the .05 


348 Ralph M. Dreger and Lewis R. Aiken, Jr. 


TABLE 3 


Terracnoric CORRELATIONS AMONG 17 ITEMS FROM 
CONFIDENTIAL Data SHEET 


5 |6 |9 |21|27|29|34| 38 40 41 42 |44| 45 | 47] 48 | 49 
3 |.41).10).67|.37|.33].241.12! .50| .90| .03| .14|.28| .32.35 .31 a 
5 -44).37/.70).59].29].37| .09| .52| .30| .49].52| .67|.50 -54 fee 
6 -14)..26).28).32).31] .02| .40| .s2| .40|.31 .50|.51| .38 p 
9 -33..86..16..22! .64| .21| .18 .17].06| .24|.19| .24 Ee 

21 37.29.53. .27| .13| .31| .40].50| .71.34| .60 [^ 
27 :43.38  .59| .45| .99| .56..59| .62).55 55 e 
29 16 | .08| .30| .26| .32).47| .47|.38| .28 ES 
34 [-.09 .46| .37| .36.41 | .31.39| .29 |. " 
38 —.01|— .08|— .03|.01|— .02].14|— .01 r^ 
40 -48| .47|.35| .48|.57| .68 i 
41 .48..30| .48|.52| .47 9 
42 .47| .50.58| .45 |.38 
bis .58.37, .51 |.13 
n „58| .52 |.97 
pi m 
48 . 
|t. jl | 1j JJ]. 14 


Anxiety and « 
low scorers on 
significant, eve 
of the arithme 
that all of the 
and low scorer: 

Correlation 
Mathematics 


TABLE 4 


NAL 
TWEEN “NUMBER ANxiIETY" AND Fi 
WrcHSLER-BELLEVUE IQs, AND 


Biserrat CORRELATIONS Bg 
GRADES IN MarTa, 


ACE Scores 
No. of Wechsler-Bellevue ACE - 
Subjects Grades PS Vs FS Q E 
40 — —.5 


—.04 
-20 — —.015 — 44 —.25 —.08 
70. ag 9 
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original sample, together with the correlations of Wechsler- 
Bellevue and ACE tests with Number Anxiety, are presented in 
Table 4. The only correlations which are significant are those 
between grades and Number Anxiety. 


Discussion 


Several lines of evidence lead to the position that probably 
there is an overall dimension that may be termed Number Anxiety 
which differs from “general anxiety.” The low correlations of the 
Taylor scale scores with the individual Number Anxiety items as 
well as with the total of the latter is the first piece of evidence. 
The second is the result of the cluster analysis which sharply 
separates Number Anxiety from the two “general anxiety” clus- 
ters. The third line of evidence is the highly significant changes 
which occurred in galvanic skin resistance for “number anxious" 
persons on the arithmetic instructions and arithmetic subtest of 
the Wechsler, changes which did not occur for the “general anxiety” 
group. 

Although the identification of this overall dimension seems 
the actual unidimensionality of Number Anxiety is 
d. In fact, there is reason to believe 
the surface factor isolated 


fairly secure, 
not assured by the study at han 
that there are at least two subfactors in 
here, the two brought to light by the “little factor analysis” of 
Number Anxiety items. A further analysis of the three latter 
items in the matrix of 17 items is in progress. Also, an analysis of 
further items designed to ferret out Number Anxiety is being 
conducted with the 40 Ss who constituted the original subsample. 


Nothing in the present study yields bases for determining the 


etiology of Number Anxiety. From previous thought and spon- 
taneous remarks by Ss of the subsample, a number of hypotheses 
have been formulated as to the origin of this type of anxiety which 
Ss find very handicapping. And very tentatively in the continuing 
research we are putting some of these hypotheses to the test. 
However, more work is needed to identify just exactly what 
Number Anxiety is and what differentiates it from other kinds of 
anxiety before much exact research can be done in determining 
etiology. One hypothesis which is a partially etiological one, that 
s Jy the result of low intelligence, seems to 


Number Anxiety is mere i 
be set aside by the research already done, in that neither the 
Wechsler nor ACE showed any significant, relation to Number 
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Anxiety. That actual accomplishment in mathematics is MEE 
with lack of Number Anxiety is demonstrated by we: pres -- 
research, though how much causation is involved is a sai - 
one may cause the other, vice versa, or both may be the resulta 
er factors. . 
E ae caution must be voiced in connection with the changes in 
GSR and Number Anxiety. These variables are associated, but the 
GSR cannot be strictly considered to be a measure of Number 
Anxiety, for other conditions of stress have also been shown to 
have GSR correlates. Apparently the Wechsler Verbal Scale is 
not sufficiently stressful to generate significant GSR changes 


p ; ; ; — ith- 
except ín connection with “number anxious” persons and Ari 
metic. 


Conclusions 
All three of the pr 
ported by the researc 


1. Number Anxiety does appear to be a separate Gosen 
factor from “general anxiety,” although the correlation of D a 
tween the Taylor Scale and the three Number Anxiety items 


s ds se. two 
indicates that some relation probably holds between these t" 
gross syndromes, 


2. Number Anxiet 


oposed hypotheses were substantially sup- 
h described here. 


Y does not seem to be related to ur 
intelligence as measured by either the Wechsler-Bellevue oF lated 
Scales. The Supposition that Q scores on the ACE are adn wn 
negatively with Number Anxiety must wait further research, 
the r of — 25 does not reach significance. lower 
3. Persons with high Number Anxiety do tend to make 704 
mathematics grades, as indicated by the r’s of —.44 for the 


iety 
Ss and ~.51 for the 40 subsample group between Number Anx! 
and grades in Mathematies 105. 


References -" 
i xie 
1. Buss, A. ALA follow-up item analysis of the Taylor Manifest An 
Scale. J. clin, Psychol., 1955, 11, 409-410. -— 
2, Cattell, - B. The chief invariant psychological and psye “er tt, 
functiona] unities found by P-technique. J. clin. Psychol., 
tirs i » 
d t x ety 
5, Reda, Ri, de Sarason, S. B. The relationship between "test anxi 
and other anxieties, J. Pers. 1955, 23, 317-323. 


15. 


. Silverman, R. E., & Blitz, B. Learning an 


. Taylor, Janet A. A personality scale o 


Identification of Number Anxiety 351 


. Holzinger, K. Jr., & Harman, H. H. Factor Analysis, a synthesis of fac- 


torial methods. Chicago: Univer. of Chicago Press, 1941. 


. Hoyt, D. P., & Magoon, T. M. A validation study of the Taylor Mani- 


fest Anxiety Scale. J. clin. Psychol., 1954, 10, 357-361. 


. Klein, E. Psychoanalytic aspects of school problems. In A. Freud, 


H. Hartmann, & E. Kris (Eds.), Psychoanalytic study of the child. Vol. 
III-IV. New York: International Universities Press, 1949. Pp. 369- 


390. 
Kramer, G. A. The effect of certain factors in the verbal arithmetic problem 


upon children's success in the solution. Baltimore: Johns Hopkins Press, 


1933. 


. Levy, D. M. Maternal overprotection. New York: Columbia Univer. 


Press, 1947. 


. Nichols, R. C., & Daroge, T. An electronic circuit for the measurement 


of the Galvanie Skin Response. Amer. J. Psychol., 1955, 58, 455-462. 


. O'Connor, J. P., Lorr, M., & Stafford, J. W. Some patterns of manifest 


anxiety. J. clin. Psychol., 1956, 12, 160-163. 


. Plank, Emma N., & Plank, R. Emotional components in arithmetical 


learning as seen through autobiographies. In A. Freud, H. Hartmann, & 
E. Kris (Eds.), Psychoanalytic study of the child. Vol. IX. New York: 
International Universities Press, 1952. 


. Schonell, F. J. Diagnosis of individual difficulties in arithmetic. Edin- 


burgh: Oliver and Boyd, 1937. 
d two kinds of anxiety. J. 


abnorm. soc. Psychol., 1956, 52, 301-303. 
f manifest, anxiety. J. abnorm. 


soc. Psychol., 1953, 48, 285-290. 
Wechsler, D. T'he measurement of adult int 
and Wilkins, 1944. 


clligence. Baltimore: Williams 


Received February 13, 1957. 


THE RELATION OF DEPARTURE TIME AND 
RETENTION TO ACADEMIC ACHIEVEMENT 


ABRAM M. BARCH 


Michigan State University! 


For over three decades there has been a slight but constant 
interest in the study of persistence and its relationship to achieve- 
ment, especially academic achievement. Despite this interest, 
several promising suggestions and studies have not been followed 
up. Thornton (5) hypothesized that the more the test of persistence 
approximates the school situation, the higher would be the corre- 
lation between persistence and school success. Briggs and Johnson 
(2) found that a persistence measure obtained in the achievement 
situation itself (namely, time to complete an achievement test 
given without a time limit) was related to achievement when 
general verbal ability was taken into account. French (3) devel- 
oped a short test of persistence which he found resulted in a? 
improvement in Prediction of academic success above that ob- 
tained by conventional aptitude tests. His test consisted of 30 
number series problems given with the instructions that some 
problems were insoluble and Were to be so marked and was score 
In terms of amount of time spent on the test. 


Problem 


The present study had two objectives. The first objective ven 
to repeat the Briggs-Johnson study in a situation where mea 
information would be available about the aptitudes of the studen e 
and the characteristics of the achievement test items and to ee 
if a better estimate of the predictive value of their persisten 


k r- 
measure could not be obtained. The second objective was to ed 
mine the effect on achievement test; score of requiring all stude 
to remain i 


n the testing situation for a period of time distinctly 
longer than most students would stay without restraint. ae 
e assumption behind the second objective was that <8 
merely spending more time in the testing situation, irrespect 
a. 
The author wishes 


ichigan State 
: to thank the Board of Examiners of Michigan 
Versity for the ass: 


istance rendered in carrying out this study. 
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of the reason(s) for spending this additional time, might be the 
factor responsible for increased achievement. The particular 
method selected for retaining students in the testing situation 
beyond their usual departure time was chosen to interfere least 
with the normal procedure of achievement testing. In addition to 
this aspect of minimal interference, the reason given for the deten- 
tion was chosen so as to be irrevelant (or neutral) to any given 
person’s level of persistence or achievement aspiration level. 


Procedure 


Achievement test 

At Michigan State University, all students are required to take 
a one-year sequence in natural science unless they have had courses 
in that area at other institutions or unless they can pass special 
qualifying examinations in the area. The third term of this course 
is concerned with the atomic-molecular theory and the related 
concept of energy. It includes laboratory exercises on gas laws, 
atomic theory and electricity, and the nature of matter. 

The final examination of the course, new each term, is prepared 
by a group of course instructors working in conjunction with 
members of the Board of Examiners. The test consists of 120 
multiple-choice five-alternative items. The test items are intended 
to measure comprehension rather than knowledge of specifies. 
Classification of test items in the examination used in this study 
(spring of 1955) according to categories based on the Taxonomy 
of Educational Objectives (1) indicated that the test contained 8 
items on knowledge of specifies, 38 items on knowledge of ways 
and means of dealing with specifics, 45 items concerned with 
ability to make use of materials or ideas through interpretation, 14 
with the use of abstractions in particular and 


items concerned : 
rned with the analysis of 


concrete situations, and 15 items conce 
relationships.” 
Procedure 

The final examination was given to all students in the course at 
the same time in the same building. Instructional classes were 
grouped for examination purposes. The number of students per 
room when tested varied but averaged about 60. 


2 Clarence Nelson performed the classifications. ... 
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Subjects were students in two sets of six classrooms ee so 
selected that no student was in an examination room with a 
proctor who was also his natural Science instructor. In all rooms 
distribution of the tests was supposed to begin at 10:10 a.m. . 

In six of the rooms there was no change from normal am 
procedure. The proctors merely recorded the departure time o 
Students in units of five minutes each from the time the tests were 
distributed. The first Student left just less than an hour after the 
test began. The testing session was terminated at 12:30 or shortly 


thereafter. The students in these rooms formed the initial basic 
group. 

In the other six testi 
write the followin 
12:00. This is ne 
early. 


ng rooms, the proctors were requested E 
g on the blackboard: Please remain seated a 
cessary to reduce disturbance by those finishing 


Analysis 


In order to obtain a more homogeneous group with zanen 
educational background, only those students who had entered on 
University in fall, 1955, and who had taken the entire orienta e 
test battery were retained for further study. This group inclu en. 
& small proportion of transfer Students in addition to ient 

he remaining basic groups were composed of 190 men ee 134 
Women. The remaining experimental groups were composed o tely 
men and 48 Women. Men and women were treated separa’ res 

ecause intercorrelations between tests and achievement measu ep 
had been found to be related to sex at M.S.U. with the wom 
usually Showing higher intercorrelations than men. were 

^, * aptitude tests in the orientation test battery that ACE 
Utilized in thi Study included the L and Q portions of the A lly 
(1940 Version), a locally prepared Reading Test (RT) and a loca ns 
Prepared Arit etic Proficiency Test (A). The intercorrelatio 5 
of these tests and the predictive value of the RT and the A T 
large group or MSU. reshmen has previously been published O 

he RT was the best single predictor of first term grade por 
average, 
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Results 


Significance of departure time 

Inspection of the correlational plots relating departure time to 
the other measures indicated that the relationships could be as- 
sumed to be linear. This is in contrast to the results obtained by 
Briggs-Johnson who generally found a U-shaped trend between 
departure time and achievement. In the present study, the longer 
the students stayed the better was their achievement score. 

Table 1 gives the product moment intercorrelations for natural 
science final (NS), departure time (DT), RT, A, L and Q. Values 
for men appear above the diagonal; values for women are below 
the diagonal. The relationship between DT and NS is low. This 
was expected since verbal ability was expected to obscure the 
relationship between these two measures; i.e., leaving at a given 
time after the beginning of the test means one thing when done 
by a person with good verbal ability and something else when 
done by a person with low verbal ability. 

It will be noted that DT correlated negatively with all other 
measures except NS and A for both men and women. The positive 
relationship with A and the tendency for A and DT to include 
similar variance in multiple combinations was somewhat puzzling. 
Further investigation suggested the apparent reason. A was given 
with such a liberal time limit as to be essentially an untimed test, 
and the scores on this test apparently included some variance due 


to this untimed aspect. 


TABLE 1 
Tesr Scorns, DEPARTURE Timp 


INTERCORRELATIONS OF ORIENTATION 3 
AND NATURAL BCcrENCE FINAL SconE' 


1. Natural Science 
Final Sos d E c8 


2. Departure Time 
3. M.S.U. Reading 44  —22 .25 63 29 
Test 
4. M.S.U. Arithme- .51 .13 .35 N 45 
tic Test 
5. L scale, ACE .50 —-10 E ih .88 "m 
.32  —.15 -AT  .08 .45 


6. Q scale, ACE 
a Men (190) above diagonal; women (108) below diagonal, 
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TABLE 2 
AL 
PARTIAL CORRELATION COEFFICIENTS BETWEEN NATURAL SCIENCE FIN 
Score AND DEPARTURE TIME 


Comparisons Men (190) Women (108) 
T 12.3 .23** .27** 
T4 .13 pin 
Tiu .18* .20 
Ti2.8 Met .22* 
712.6 .20** .20* 
712.56 .25** .24* 


* .05 level of significance, 
** 01 level of significance. 
"See Table I for names of variables. 


Table 2 gives the first and second order partial correlation €O- 
efficients between the various predictor measures and NS. = 
partials are significantly different from zero at the .05 level io 
better except the first order partials with A as the partialle 
variable. naa BB 

Another way of evaluating the importance of departure tinea 
a measure for predicting academic achievement is to determi le 
the extent to which it increases the magnitude of a mon 
correlation. Table 3 lists the R’s obtained for combinations pes 
predictor variables with and without DT when NS and a 
point average for first M.S.U. year were the criterion variab e age 

Some loss in cases occurred in the group with grade point aver : 
as the criterion. Excluded from this group were students less 
not return to school the next fall term and students who ha asd 
than 32 credits or more than 63 credits at the end of their men 
M.S.U. year. The remaining groups were composed of 146 
and 90 women. dictor 

In all comparisons except one, addition of DT to the pre! el or 
variables significantly increased the size of R at the 05 
better. The one exception which involved the prediction Q nifi- 
for women using RT and A as predictors was borderline to EE F 
cance at the .05 level. The increase in predictive variance enn 
cases was small, but it must be remembered that ee 
is essentially one response. The results suggest the value of © 


i tia b te aca- 
ing additional measures of this type for use in prediction » 
demic achievement, 
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TABLE 3 


MULTIPLE CORRELATIONS PREDICTING NATURAL Scrence FINAL SCORE 
on First Year GRADE POINT AVERAGE WITH DEPARTURE TIME 
EXCLUDED AND INCLUDED 


Criterion Variable Predictor Variables R’sfor Men — R's for Women 
Natural Science 35, 4 45 -58 
Final Score 2, 3, 4 48 .60 
(Men = 190) 5, 6 .36 .51 
(Women = 108) 2, 5, 6 43 55 
First Year Grade 3,4 .36 .73 
Point Average 2, 3, 4 .40 ate 
(Men = 146) 5, 6 .89 .53 
(Women = 90) 2, 5, 6 .45 .57 


a See Table 1 for names of variables. 


Sex differences 


The mean NS score for men (basic group) was 61.4 and that 


for women (basic group) was 56.5. The difference was significant 
at the .01 level. Men and women were not significantly different 
in DT, RT, A or L. The men scored higher (.01 level) on the Q 


scale than did the women. 
The point of interest in t 
differ in terms of test persis 
is due to the nature of the tes 
would obtain in a situation where the w 


test achievement. 


hese results is that the sexes did not 
tence. It is possible that this finding 
t and that somewhat different results 
omen would be superior in 


Effect of Retention 
The results of the basic groups indicated that 74 % of the women 
and 85% of the men could have been expected to leave by 12:00, 
the end of the retention period for the experimental groups. An 
analysis of covariance Was performed for men and women sepa- 
rately using RT score as the control variable. The F in both com- 
parisons failed to reach the 05 level. Thus, no evidence was ob- 
tained in support of the hypothesis that merely spending more 
time in the testing situation would lead to better achievement. It 
e manner of retention was designed to 


should be recalled that th n 
provide minimal interference with normal testing procedure and 


the reason given for the retention was neutral with respect to a 
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T " ext 
given person's level of persistence or aspiration lev êl; beige 
question to be asked is whether instructions stressing the i P 
tance of taking more time on the test will affect achievement. 


Conclusions 


ts 
1. Persistence as measured by time taken by college een 
to complete an achievement test was found to be signi vate 
related to test achievement, and grade point average when 
ability was partialled out. ] sjo 
2. The persistence measure significantly improved multip 
regression predictions for academic achievement. 


: in test 
3. No difference was found between men and women ir 
persistence. 


4. Retaining studen 
Which were neutra] w 
achievement, 


: f ions 
ts in the testing situation with eh 
ith respect to achievement had no affe 
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DIFFERENTIATING CHARACTERISTICS OF 
A GROUP OF STUDENTS HAVING 
PSYCHOLOGICAL 
PROBLEMS 


Grorce DovGias Maro 


Naval Air Technical Training Command, Memphis! 


Virtually everyone associated with an educational program has 
had occasion to observe students with psychological problems and, 
frequently, to inquire whether or not the school situation is in 
some degree responsible for the condition observed in the student. 
Rotter (5, pp. 416-423) has discussed this matter in an astute 
manner and has attempted to treat it within the general frame- 
work of his social learning theory. In addition to the orientation 
described in the following paragraphs, a section of the present 
paper has been devoted to consideration of the findings of the 
study in terms of Hull’s system (3) as adapted to emotional be- 
havior by Brown and Farber (1). 

It is important from the practical point of view to know whether 
psychological problems tend to be more closely associated with 
factors in the immediate environment or more closely associated 
with factors more remote from the current situation. If it is known 
that maladjustment is associated primarily with factors in the 
immediate environment, it may be possible to change these factors 
in such a way as to alleviate the condition. If not, it is necessary 
to look to other, probably more difficult, remedial procedures. 

The study was conducted within this general framework of 
situational versus nonsituational causation of psychological 
problems and disorders. But no claim can be made that it provides 
definitive results concerning this general problem. In this connec- 
tion it should be noted that it was necessary to use available 
variables which could be classified as situational or nonsituational 
instead of following the more desirable procedure of sampling a 
universe of situational and nonsituational variables. The study 
does, however, identify and cross validate certain variables which 


1 The opinions expressed are those of the writer and are not necessarily 


shared by the Department of the Navy. 
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differentiate to a Statistically significant extent navy trainees 
requiring assistance with psychological problems from those 
not requiring such assistance, 

For purposes of the study, situational variables were defined as 
those primarily in the present environment, while nonsituational 


. variables were defined as those essentially independent of the 
present environment. 


Procedure 


The study was conducted at a training center for naval aviation 
technicians having approximately 5500 navy and marine enlisted 
men under instruction throughout the period of the study. This 
period was from July 1953 through June 1954. 

As shown in Tables 1, 2, and 3, the seven situational variables 
consisted of the schools in which the students were enrolled and 
measures of success achieved by the students in the schools. The 
nine nonsituational variables consisted of aptitude test scores; 
overachievement-underachievement, in a previous navy school, 
educational level, age, service membership (i.e., Navy or Marine 
Corps), sex membership, petty officer status versus nonpetty officer 
status, and recruits versus men returned from the fleet for train- 
ing. 

The study is actually two almost indentical studies, one for the 
period of the last half of 1953, and the other for the first half © 
1954. It was decided in advance that the null hypothesis ir 
be rejected only in those instances in which differences were d 
cant at less than the .05 level in the case of both samples. 
might be argued that the same end could be achieved by an 
à More exacting confidence level for the rejection of the nu 
hypothesis, but this is not quite true. We know that condition" 
change to Some extent both within the schools and with nei 
to the students assigned to the schools over a period of time. s 
the present study, we were interested in whether the d 
that occurred over a period of several months would affect the 
significant differences observed in the initial sample, as well E 
in the stability or the initial findings from the standpoint of hee 
pling error, Selecting a more rigorous confidence level would satis!¥ 
nd of these two interests. ho 

The Psychological problems groups consisted of all men W 
were so classified by the medical staff at the naval dispensary 
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to which they reported, either on their own initiative or were sent 
by the school which they were attending. Cases which were quite 
serious were transferred immediately to a naval hospital near the 
training center. Less serious cases were treated at the dispensary 
as outpatients. Thus, the psychological problems groups included 
students having a wide range of problems and behavioral dis- 
orders. The total number in the group for the last half of 1953 
was 170, and for the first half of 1954, 162. This does not represent 
a decrease, since the population of the training center underwent a 
proportional decrease. 
It was considered likely that certain schools were contributing 
a disproportionate share to the psychological problems groups. 
This proved to be true as shown in Table 1. This information, 
supplemented by the fact that number of students, aptitude test 
scores, and failure rates vary from school to school made it ad- 
visable to stratify the two samples of normal students with which 
the psychological problems groups were compared. The stratifica- 
tion resulted in the normal groups containing the same proportion 
of students from each school that the psychological problems group 
for the same time period contained. Having arrived at the propor- 
tion of the normal group that would be drawn from each school, 
the appropriate number was drawn by putting the IBM cards of 
all men attending the school during the period in service number 
order and drawing every nth eard. If this stratification had not 
been accomplished it would have been difficult to state whether 
observed differences between the psychological problems groups 
and normal groups were due simply to school membership rather 
than to the variables per se. Two samples of 250 normal students 
were formed in this way, one from students attending the schools 
during the last half of 1953 and the other from students attending 
the schools during the first half of 1954. For purposes of the study, 
red to be normal if they had not been 


all students were conside r 
seen by the medical staff for psychological reasons. In the com- 
parisons in which it was appropriate to compute means, the critical 


ratio statistic was used. In those instances in which the comparison 
of numbers of subjects was appropriate, chi square was computed. 
Results and Discussion 


The results of the study are shown in Tables 1, 2, and 3. Tables 
2 and 3 contain all of the comparisons in which the chi-square 
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TABLE 1 
ScHooL MEMBERSHIP AND PSYCHOLOGICAL PROBLEMS 


1953 Sample 1954 Sample 


Aviation School 


Pm m Chi Square Ea E Che Suys 

** 

Engines 13 32.8 14.8** 14 27.2 T. 
Structures and Hy- 24 20.6 00.6 30 17.3 10.4 

draulics T 
Electronics 114 84.8 20.1** 100 81.0 8.9 
Training devices 11 10.2 00.1 10 9.2 0.1 


2 in 
* Expected frequencies are based upon the proportion of students 1 
each school to total students at the training center. 

** Significant at .01 level. 


statistic and critical ratio 
the exception of School me 
variables have been classi 
nonsituational. 
School membershi 


Statistic respectively were used, yon 
mbership. In both of these tables r 
fied as being primarily situational O 


D Which is viewed as a situational variable A 
considered in Table 1. In the case of both the 1953 and 1954 
samples, the aviation electronics school contributed a disp ~~ al 
tionately large number of students to the psychological proper 
groups, and the aviation engines school contributed a disprop im 
tionately low number. It is interesting to note that the SM 
course is more abstract and more difficult than the engines peres 
Furthermore, despite the assignment of students with hig er 
aptitude to electronics, the course is failed by a considerably larg 
Proportion of students than is the aviation engines course. ical 
Table 2 indicates that both the 1953 and 1954 psychologic 
problems groups contained a significantly larger pu 
academic failures than did the normal groups. From m ilure 
presented, however, one cannot be sure whether school blems 
tended to cause the higher incidence of psychological p s the 
or whether a tendency toward psychological problems cause tain 
observed higher failure rate. In addition to membership in ie OL 
Schools and academic failure, a difference, significant at t iable 
level, was found in the 1953 comparison on another caes 
classified as Situational. This variable was school grades, as h the 
in Table 3 The same comparison for 1954 did not ee it by 
standard set for Statistical significance but closely approache 


— 
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TABLE 2 


COMPARISON or PSYCHOLOGICAL PROBLEMS Group witn NORMAL GROUP 
on CERTAIN VARIABLES USING THE CHI SQUARE STATISTIC 


1953 Groups 1954 Groups 
Comparisons pic Normal Len fA | Normal 
roblems sets Problems Chi Square 
LJ e o e o e o e 
—M—Ó ee — 
Situational 
Academic failures| 29 | 19 | 18 | 28 | 9.90**) 24 | 17 20 | 27 | 4.79* 
Other students 141 |151 |232 |222 138 |145 |230 |223 
Academic set- | 84 | 79 |112 |117 | .87 | 72 | 76 122 |118 -75 
backs 
Other students 86 | 91 |138 |133 90 | 86 |128 |132 
Nonsituational 
Petty Officers 13 | 11 |15 |17| .44 | 12] 12] 18] 18 .01 
Nonrated men 157 |159 |235 |233 150 |150 |232 |232 
Recruits 135 |140 {210 |205 | 1.45 |110 |125 208 193 | 13.07** 
Fleet returnees 35 | 30 | 40 | 45 52 | 37 | 42 | 57 
Navy men 110 |123 |193 |180 | 7.86**| 96 |107 |177 166 | 5.86* 
Marines 60 | 47 | 57 | 70 66 | 55 | 73 | 84 
Male 164 |166 |245 |243 43 |152 |157 |247 |242 7.96** 
Female e| 7| 5| 4 10| 5| 3| 8 


* Significant at .05 level. 

** Significant at .01 level. 

fidence. It should be noted that ade- 
l grades were not available for the 
te the schools. Hence the comparison 
on the basis of school grades includes only 67 of the 170 students 
in the 1953 psychological problems group and 42 of the 167 students 


in the 1954 psychological problems group. 

The comparison involving the one remaining situational variable, 
academic setback, as shown in Table 2 did not yield a significant 
difference. The term “setback” refers to repeating a unit, usually 
two weeks in length, because of unsatisfactory progress in the unit. 

Considering now the variables classified as nonsituational and 
on which significant differences were found in both the 1953 and 
1954 samples, Table 2 indicates that the Marine Corps tended to 
contribute men to the psychological problems groups to a greater 
extent than did the Navy. Two additional variables classified as 


reaching the .06 level of con 
quate data concerning schoo 
students who failed to comple 
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TABLE 3 


Comparison OF PSYCHOLOGICAL PROBLEMS Group with NormaL Group 
on Certain VARIABLES USING THE CnrTICAL Ratio STATISTIC 


1953 Groups 1954 Groups 
" Psychologi- d Psychologi- d 
Gomparisous cal Problems| Normal cal Problems| Normal 
CR u * I CR 
Mean| SD |Mean| SD Mean| SD |Mean| SD 
Situational 
School grades 73.15) 5.00 | 75.25) 5.04 | 2.80** | 73.71] 5.41 | 75.47] 5.51 | 1.91 
Nonsituational 


Marine aptitude bat- | 55.76) 5.99 | 58.56] 5.85 | 2.43* | 55.14) 5.88 | 56.53) 6.96 | 1.25 
tery, composite 
score 

Navy aptitude bat- 57.16) 5.18 | 58.65) 4.71 | 2.46* | 54.76) 5.90 | 56.00) 6.65 | 1.57 
tery, composite 
score 


Over-underuchieve- 98,97) 5.23 |100.14| 4.91 | 2.00* | 98.9 | 6.11 |101.81| 5.72 4.59** 
ment index 

Civilinn education 11.21| 1.21 | 11.82| 1.27 | 4.77** | 10.88| 1.40 | 11.52 1.22 | 4.71** 

Age 20.92) 2.48 | 21.54) 2.50 | 2.42* | 20.94| 3.67 | 21.35 2.90 | 1.18 


* Significant at .05 level. 
** Significant at .01 level, 


nonsituational in Table 3 met the standard set for significance in 
both the 1953 and 1954 samples. The first of these was over- 
achievement-underachievement in a previous service school, 
the psychological problems groups tending toward underachieve- 
ment. The second variable was years of civilian education, the 
mean of the psychological problems groups being less than that of 
the normal groups. 

f The remaining nonsituational variables did not meet the condi- 
tion set for significance for the study. These included aptitude 
Scores, age, sex membership, petty officer versus nonpetty officer 
status and recruit versus fleet returnee. Tt should be noted in this 
connection that these results do not necessarily indicate that no 
differences exist on these variables. The conditions of the study 
were not sufficiently rigorous with respect to numbers and perhaps 
with respect to other considerations to justify the statement that 
any existing differences were given an adequate chance to be 
observed. It is believed, however, that one can have reasonable 
confidence in the Stability of the differences that satisfied the 
standard set for the study for the rejection of the null hypothesis. 
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Theoretical Considerations 


Empirical data frequently are better understood in the light of 
theory. Similarly, improvement of theory depends to a major 
extent upon its exposure to empirical data. One of the more highly 
developed theories of behavior is that of Hull (3). Brown and 
Farber (1) have added certain constructs to Hull’s system with a 
view to a more adequate handling of emotional behavior. Further, 
they have illustrated how their constructs fit into the larger system 
by considering phenomena commonly ascribed to frustration, in 
terms of their proposed method of conceptualization. The present 
study clearly does not lend itself to the rigorous procedures laid 
down by Hull for checking a segment of a theory and revising it in 


accordance with the results of the experiment. Nevertheless, it is 
system, as expanded by 


of interest to sce to what extent Hull's 
Brown and Farber, gives unity to the observed functional rela- 
tionships and facilitates understanding of them. Reference to 
Brown and Farber’s diagram summarizing the relationships 
between antecedent conditions, intervening variables, and conse- 
quent conditions (p. 492) will be helpful in the following discussion. 
This diagram has been reproduced by Lambert also (4, p. 66.). 
One of Brown and Farber's key explanatory concepts 1s frustra- 
tion. They use the term to connote a hypothetical state oF condition 
of the individual, as opposed to 2 frustrating event or external 


barrier. The term is viewed as an intervening variable between 
ariables which ultimately are 


Other construets or intervening V? š 
anchored both at the antecedent end and consequent or response 
end by observable conditions. In the present study, the antecedent 


conditions may be thought of as including the 16 — on 
which the nor : and the psychological problems group 
"Wm six of these antecedent condi- 


Were compared. ted previously, 

tions vu ETA be significantly related to ne consedit 
condition, that is, behavior which resulted in the individual being 
classified in the psychological problems group. — 

It is assumed that successful completion of the aviation schools 
Was an important intermediate goal or means to an end for virtually 
all students. The technical schools are à usual route to advance- 
ment and status in the various aviation occupations. Moreover, 
most, of the students had asked to be sent to the school in which 
they were enrolled in preference to some other aviation school. 
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Adapting the theory to the present data the intervening variable, 
“drive,” Do , is envisioned as a function of the desire to complete 
the school, and Dc as a function of some other desire or need. 
More than one of these competing drives, De , may have to be 
taken into account in a given situation. This concept of drive is a 
departure from Brown and Farber’s diagram on two counts. 
First, drive is described in terms of desired goals instead of depriva- 
tion or noxious stimulation. Second, in the Brown and Farber 
diagram, D without subscripts is used and defined as "relevant 
drive.” Hull has defined D as “strength of dominant primary drive 
operative in the primary motivation to action after the formation 
of the habit involved." The use of Do and Dg insures that the 
appropriate drives will be applied to the habits involved, that is, 
to habit strengths having the same subscripts as the drives. 

The intervening variables sHp, and sHp, represent habit strength 
of an ongoing response tendency directed toward achieving the 
goal of school completion, and habit strength of an ongoing response 
tendency directed toward achieving some other goal, respectively. 
When sHg, and sHro are multiplied by their respective drives 
two additional constructs emerge, sEro and sER, , which represent 
initial excitatory strength of the response tendency directed toward 
school completion and inital excitatory strength of the response 
tendency directed toward a goal other than school completion. 
The intervening variable “competitive inhibitory tendency,” Ir, 
is envisioned as a function of blocking and nonreinforcement of 
effort to achieve the goal of successful school completion. ‘“Frustra- 
tion,” F, the key intervening variable, results from the competi- 
tion or incompatability of sEro with both Ip and sEn, - 

At this point it may be well to see how well the six variables on 
which significant differences were found fit into the theoretical 
structure described thus far. Membership in the electronics school 
3S associated with high Ip values and membership in the engines 
enne Fin p Ir values since the electronies school has a 
a dei ij rate than does the engine school. Similarly, 
wl, mode era of normal students stratified to contain 
paychologieal nimi rom each school as were contained in the 
failure is signifie iig group, we confirm our notion that school 

antly related to membership in the psychological 
problems group, This would result. i p psycholog 
individoxis that " sult in a larger value of Ir in the 
uttimately made up the psychological problems 
p 
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group than in the normal group. These larger Ig values would more 
nearly equal the value of sEx, in the psychological problems group 
than in the normal group, a condition which, according to the 
theory, increases the value of F. The results concerning the ante- 
cedent conditions, educational level, and  overachievement- 
underachievement in a previous navy school can be explained in a 
similar way. It is reasonable that the behaviors which underlie 
both of these variables are related to success or failure in the 
aviation schools. The explanation of the final variable on which a 
significant difference was found, membership in the Marine Corps 
versus membership in the Navy, is seen in the larger value of F in 
the case of marines due to competition between the incompatible 
response tendencies sEro and gEg,. The present study provides 
no data on the subject, but there is considerable informal evidence 
to indicate that marines join the corps with the goal of becoming 
a member of a conventional marine combat unit. Many are not 
volunteers for aviation, as is true in the case of navy men. This 
results in a higher Dc value for marines, which, when multiplied 
by its appropriate habit strength, sHro , results in a larger value 
of sEro for marines than for navy men. Similarly, the conditions 
that result in a larger value of sEx, may also result in a somewhat 
smaller value of sEy, than that of navy men. For some individuals 
sEro may exceed sEro . The behavior resulting from this condition 
under different circumstances will be considered later. In any 
event, the competing values of sEn and sEx, would be expected 
to be more nearly equal in the case of the marine group than the 
navy group, and hence a larger value of F would be expected. 

Brown and Farber hold that F has two major effects. First, it 
increases the general level of motivation. This effect is integrated 
into Hull’s theory by considering F as an "irrelevant drive,” 
for which Hull has presented formulas. In the present adaptation 
of the theory “effective drive strength,” Do, results from the 
combination of the drive to complete the school, Do, and the 
"irrelevant drive due to frustration," D. When sHp, is multiplied 
by this new drive strength, Do , the construct, “final excitatory 
strength of the response tendency directed toward completion of 
the school,” &Eg,', emerges. D also combines with De, which 
through the same process as that just described, results i E b 
which is the “final excitatory strength of the response tendency 
directed toward a goal other than school completion.” 
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Adapting the theory to the present data the intervening variable, 
“drive,” Do , is envisioned as a function of the desire to complete 
the school, and Dc as a function of some other desire or need. 
More than one of these competing drives, De , may have to be 
taken into account in a given situation. This concept of drive is a 
departure from Brown and Farber’s diagram on two counts. 
First, drive is described in terms of desired goals instead of depriva- 
tion or noxious stimulation. Second, in the Brown and Farber 
diagram, D without subscripts is used and defined as “relevant 
drive." Hull has defined D as “strength of dominant primary drive 
operative in the primary motivation to action after the formation 
of the habit involved." The use of Do and Dg insures that the 
appropriate drives will be applied to the habits involved, that is, 
to habit strengths having the same subscripts as the drives. 

The intervening variables sHro and sHg represent habit strength 
of an ongoing response tendency directed toward achieving the 
goal of school completion, and habit strength of an ongoing response 
tendency directed toward achieving some other goal, respectively. 
When sHro and sHro are multiplied by their respective drives 
two additional constructs emerge, sEro and sErg , which represent 
initial excitatory strength of the response tendency directed toward 
school completion and inital excitatory strength of the response 
tendency directed toward a goal other than school completion. 
The intervening variable "competitive inhibitory tendency,” In ; 
is envisioned as a function of blocking and nonreinforcement of 
effort to achieve the goal of successful school completion. '*Frustra- 
tion," F, the key intervening variable, results from the competi- 
tion or incompatability of sEro with both Ig and sEro . 

At this point it may be well to see how well the six variables on 
Which significant differences were found fit into the theoretical 
Structure described thus far. Membership in the electronics school 
is associated with high I, values and membership in the engines 
schools with lower Ig values since the electronics school has 2 
much higher failure rate than does the engine school. Similarly, 
after drawing a sample of normal students stratified to contain 
the same Proportions from each school as were contained in the 
psy chological problems group, we confirm our notion that school 
failure is significantly related to membership in the psychological 
problems group. This would result in a larger value of Ip in the 
individuals that ultimately made up the psychological problems 
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group than in the normal group. These larger Ig values would more 
nearly equal the value of gEx, in the psychological problems group 
than in the normal group, a condition which, according to the 
theory, increases the value of F. The results concerning the ante- 
cedent conditions, educational level, and overachievement- 
underachievement in a previous navy school can be explained in a 
similar way. It is reasonable that the behaviors which underlie 
both of these variables are related to success or failure in the 
aviation schools. The explanation of the final variable on which a 
Significant difference was found, membership in the Marine Corps 
Versus membership in the Navy, is seen in the larger value of F in 
the case of marines due to competition between the incompatible 
response tendencies «Eg and sEnro - The present study provides 
no data on the subject, but there is considerable informal evidence 
to indicate that marines join the corps with the goal of becoming 
2 member of a conventional marine combat unit. Many are not 
volunteers for aviation, as is true in the case of navy men. This 
results in a higher De value for marines, which, when multiplied 
by its appropriate habit strength, sHx , results in a larger value 
of sExo for marines than for navy men. Similarly, the conditions 
that result in a larger value of sEr¢ may also result in à somewhat 
Smaller value of sEro than that of navy men. For some individuals 
six, may exceed sEro . The behavior resulting from this condition 
under different circumstances will be considered later. In any 
event, the competing values of gEno and sEnc would be expected 
to be more nearly equal in the case of the marine group than the 
navy group, and hence a larger value of F would be expected. 
. Brown and Farber hold that F has two major effects. First, it 
Increases the general level of motivation. This Lessa is integrated 
into Hull’s theory by considering F as an irrelevant drive, 
for which Hull has presented formulas. In the present adaptation 
of the theory “effective drive strength,” Do, results from the 
Combination of the drive to complete the school, Do , and the 
"irrelevant drive due to frustration,” D. When slg, is multiplied 
Y this new drive strength, Do , the construct, “final excitatory 
Strength of the response tendency directed toward completion of 
© school,” gEx,’, emerges. D also combines with Dc, which, 
, Tough the same process as that just described, results in sER,’, 
Which is the “final excitatory strength of the response tendency 
‘rected toward a goal other than school completion.” 
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The second major effect of F is to produce unique internal 
stimuli, Sp . These stimuli may have innately associated response 
tendencies, sUr , or they may acquire new associative connections, 
srHr, through processes of learning. These response tendencies 
are largely maladaptive and may bear little rational relationship 
to the objective toward which the thwarted drive was directed. 
These response tendencies, when multiplied by Sp , result in the 
final construct in the system, s, Eg , strength of excitatory tendency 
associated with frustration." 

At this point we have three final response tendencies, sBs, 
sER,’, and spEr . Assuming threshold strength, the response which 
will occur depends upon the relative strength of the three final 
response tendencies. If gs,’ has the greatest magnitude, behavior 
directed toward completion of the school will occur. This may 
take the form of additional study, seeking assistance from in- 
structors or classmates outside of class, and/or other adaptive 
behavior related to successful school completion. If Epo’ is the 
strongest of the three final response tendencies, the other two will be 
inhibited, and behavior directed toward a goal other than school 
completion will occur. In the case of marines who joined the corps 
with a desire to be a member of a conventional marine combat 

unit, it may take the form of behaving in a manner calculated to 
obtain a transfer to such a unit. If s, Én is greater than the other 
final response tendencies, the response will tend to be maladaptive. 
In a military organization, perhaps to a greater degree than in a 
civilian setting, this type of behavior is likely to result in a referral 
for psychological problems. Unlike Dollard et al. (2), Brown and 
Farber do not specify precisely which response, e.g., aggression, 
Withdrawal, or primitivization, will occur. The theory does include 
the expectations that the presence of frustration will often result 
in the intensification of response, that reactions followed by & 
reduction in frustration will be reinforced, and that responses 


associated with frustration in one situation will tend to appear iD 
other thwarting situations. 


Th MM : . : 
e above discussion obviously is no more than exploratory 1P 


dE ko Hull, Brown and Farber theory to the type of data 
uitis the ba the numerous limitations, including explanation 
2086 ths i = absence of quantification, it is interesting tO 
can be tandied cà ease with which the observed relationship? 

ed DY the theory. It raises the important question 9 
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whether the method employed in the present study could be 
adapted to the task of providing some of the empirical data 
required for detailed systematic development of the Brown and 
Farber theory of frustration. Perhaps this question could be an- 
swered by one or two studies in which the antecedent conditions 
were selected in accordance with the best information currently 
available, and measured as carefully as possible. The consequent 
conditions used in the study are already suited to the theory, 
since in its initial form it does not attempt to specify the precise 
reaction to frustration that will oceur. 


Summary 


Two groups of navy and marine trainees having psychological 
problems were compared with two groups of normal trainees on 
the basis of 16 variables. Significant differences between both 
psychological problems groups and their corresponding normal 
groups were found on six of the comparisons. Students of aviation 
electronics tended to have psychological problems in greater 
numbers than did other students. The reverse was true in the case 
of men studying aviation engines. Academic failure was associated 
with membership in the psychological problems groups. The 
three comparisons just mentioned were thought of as being 
primarily situational in nature. 

The remaining three comparisons on which significant differences 
were observed were classified as nonsituational variables. These 
were: overachievement-underachievement in a previous navy 
school, years of civilian education, and membership in the Navy 
versus the Marine Corps. Thus no consistent tendency for mal- 
adjustment to be more closely associated with situational variables 
or with nonsituational variables was found. 

The findings were discussed in terms of Hull’s theory, as adapted 


to emotional behavior by Brown and Farber. 
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COMPARATIVE EFFECTIVENESS OF 
TELEVISED AND LARGE CLASSES 
AND OF SMALL SECTIONS: 


LAURENCE SIEGEL AND F. G. MACOMBER 
Miami University, Ohio 


College enrollments for the next decade are expected to rise 
sharply with an anticipated shortage of well-qualified college 
teachers in most subject areas. These conditions will place addi- 
tional pressures upon publie institutions to increase class size. 
The development. of educational television, and particularly of 
relatively low-cost closed-circuit systems, raises the interesting 
possibility of teaching large groups of students while overcoming 
some of the visual difficulties inherent in large class instruction. 

Miami University is engaged in a research program directed 
toward the comparison of the relative effectiveness of large group 
instructional procedures with smaller conventional-type classes 
of approximately 30 to 35 students. Two of the general approaches 
under investigation are designated 
t visual contaet between student and 
ireuit television) classes. This paper 
summarizes the ained during the Spring Semester, 
1956. Three problem areas were investigated: 

1. Achievement as à funetion of assignment to experimental 
or control sections; 

2. Students’ attitu 


function of section assignment; . f , 
3. Students’ end-of-the-semester attitudes about having received 


TV or LC instruction rather than conventional instruction. 


to large group instruction 
LC (large classes with direc 
instructor) and TV (closed-c 
major results obt. 


des about the instruetor and course as a 


Procedure 


Subjects 


The first two problem 
comparison of results obt 
sections of the eight undergr 


areas under investigation required the 
ained from experimental and control 
aduate courses listed in Table 1. 


! This research was conducted under a grant to Miami University from 


the Fund for the Advancement of Education. 
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Each experimental section was taught either as a large class 
(primarily lecture) or by means of closed-circuit television. The 
control sections were taught by conventional procedures. Both 
experimental and control sections were taught by the same in- 
structor and no known biasing factor was known to be operative 
as a determinant of student registration for the experimental or 
control section. Nevertheless, the relatively small sizes of the 
control groups prohibited sole dependence upon random assign- 
ment of students to balance the effects of extraneous variables. 
Consequently, the experimental and control groups for each course 
were deliberately equated on the basis of scores on the Cooperative 
English Test (Form T), Cooperative Mathematics Achievement 
Test (Form P), and total score on the American Council on Educa- 
tion Examination (1948 edition), as well as grade-point-average 
for the previous semester. 

The matching procedure required that some of the students 
actually enrolled in the experimental and control sections of each 
course be excluded from the specific experimental and control 
groups under investigation. All decisions about the constituency 
of the experimental and control groups were made prior to the 
collection of research data. Neither the instructors nor the students 
were informed regarding the actual composition of the groups 
providing data for the Experimental Study in Instructional Pro- 
cedures. The total enrollments in each course and the numbers 
of subjects included in the study are summarized in Table 1. 

A full description of the matching data is given elsewhere (2). 
It is sufficient here to indicate that none of the mean differences 
between the experimental and control groups in any course on 
any of the matching variables was statistically significant. Equat- 
ing of the experimental and control groups on the critical variables 
of ACE and grade-point-average was further supported by the 
lack of statistical significance of the chi squares between pairs of 
Score distributions on these two variables. Furthermore, both 
E each of the eight courses were demonstrated to be 
propo ina a respect to proportional sex distributions and 

istributions by class standing. 


Teaching Procedures 


The speci ` : 
not a an mstructional procedures within each course were 
rdized. Rather, instructors were encouraged to develop 
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TABLE 1 


TOTAL ENROLLMENT AND NUMBERS or Exp 


ERIMENTAL 
SuniECTS IN EACH COURSE 


Number or Seco | Sey utes | Numer Sabie 
Course Title 
TV | LC |Conrol| TV | Lc | Com | ry | rc Con 
Foundations of Hu- 1 2» 103 74 | 97 62 
man Behavior 
Principles of Human | 1 1 96 31 90 30 
Physiology 
Introductory So- 1 44. | 141 144 | 131 129 
ciology 
Human Biology 1 1 183 | 178 183 | 177 
Business and Gov- 1 1 119 | 35 105 | 30 
ernment 
Composition and 2 2° 100 | 50 37] 37 
Literature 
Essentials of Modn. 3 3° 213 | 93 88 | 85 
Geography . 
Elementary Psy- 1 1 78 | 30 77 | 30 
chology 


^ Enrollments in multiple sections were divided approximately equally 
between each of the sections. ' 

b These are the numbers of students from each section actually pro- 
viding data for the Study. a 

° Multiple sections taught by the same instructor. 

a Multiple sections taught by different instructors, each of whom ro- 
tated TV presentations. . . 

* Multiple sections taught by different instructors, each of whom also 
taught an experimental section. 


those procedures which seemed best suited to the instructional 
Situation, and which were best calculated to achieve the objec- 
tives of the course. Thus there were considerable differences in the 
manner in which the four televised courses were condueted. The 
Foundations of Human Behavior class met for two weekly periods 
9f 90 minutes each; Sociology and Human Biology were convened 
for three weekly periods of 50 minutes each. The Students in 
the experimental Physiology section met for three 50-minute tele- 
Vised presentations and for a two-hour laboratory period each week. 

he nature of the televised presentations also differed in the four 
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courses, In both Biology and Physiology, the general method was 
that of lecture supplemented by numerous demonstrations and 
training aids. Five different members of the department participa- 
ted in the televised presentations of Sociology. These presentations 
were approximately 30 minutes in length with the balance of the 
50-minute period devoted to a discussion in each receiving room 
under the leadership of a regular staff member. In Foundations of 
Human Behavior, the presentations took a variety of forms, but 
with approximately one-third of each class period given over to 
discussion under the leadership of graduate assistants who served 
as monitors in each of the receiving rooms. 

Similarly, there was no standard form of procedure in the large 
classes not utilizing television as a medium of presentation. The 
LC sections of English followed essentially the same procedure as 
that designed for the control sections except that only half as 
many written themes were required, and, of course, individual 
student participation was considerably limited by the size of the 
experimental groups. In Geography, the students received two 
lectures a week with a third period devoted to small group dis- 
cussion. The remaining two LC courses were given as three 50- 
minute lectures per week. 

The control sections of all courses were conducted as discussion 


groups with lectures and demonstrations inserted by the instructor 
whenever he thought it desirable. 

This diversity of approach was believed desirable even though it 
prohibited direct comparisons between courses. Foreing the in- 
Structors to adhere to a specific and inflexible type of presentation 
in both their experimental and control sections would have pre- 
vented them from capitalizing upon the advantages inherent in 


the specific classroom situation and would have emphasized some 
of the deficiencies, 


Evaluative I nstruments 
wean sien of instr uments provided data for this series of in- 
t eons: achievement tests and attitudinal measures, These 
wo classes of in 


a struments ar scribed s 
Athieyamient? e described separately below. 


sequence of studi was operationally defined for the present 
course examinati ^ performance on the objective portions of 
final grades, I ups administered for the purpose of assigning 

: Tt is recognized that such examinations typically 
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measure only a single dimension of achievement, i.e., subject- 
matter knowledge. Thus the present data are 
important course objectives as “synthesis,” “problem solution” 
and "eritical thinking.” The possibility of leakage of information 
about the test was overcome by simultaneous administration of 
each examination to the experimental and control sections of cach 
course with one exception. (Simultaneous administration in Biology 
was impossible because the experimental and control sections were 
taught during successive semesters.) Estimates of reliability of 
each test (Kuder-Richardson Formula 21) ranged from .55 to .93 
with a median of .88. To the extent that the assumptions implicit 
in this computation were not met, the resultant values are probably 
underestimates. 

Four Thurstone-type scales were included in the attitude battery 
administered near the end of the semester to students in experi- 
mental and control sections. These scales yield quantifiable infor- 
mation about (a) students’ appraisal of the effectiveness of the 
instructor in presenting material and handling the class (the I 
scale); (b) students’ appraisal of the course content (the C scale); 
(c) students’ attitudes about having received televised rather 
than conventional instruction (the TV scale); and (d) students’ 
attitudes about having received large class instruction rather than 
conventional instruction (the LC scale). Each of these scales 
provided a nine-point continuum with 5.0 as the neutral position. 
Student judges were requested to rate the items included in a 
preliminary pool, and items were selected for inclusion in the final 
forms of these instruments on the basis of low Q value and the 
median of the distribution of judgments. 

Split-half reliabilities (corrected for length of test) based on re- 
Sponses of a sample of 300 respondents were .91 for the I scale and 
-92 for the C scale. The intercorrelation between these two attitude 
Scales was .52. The corrected split-half reliability for the TV scale 
Was .89 and for the LC scale was .92 in a sample of 100 students, 


Results 


Conclusions based on the results obtained from a single semes- 
ter’s instruction are subject to the influence of a variety of 
errors. Only four TV and four LC courses were investigated. 
Conclusions based on these data might not apply to other courses 
or even to the same content taught by other instructors, Also, 


hot germane to such 


sampling 
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TABLE 2 


COMPARATIVE ACHIEVEMENT IN EXPERIMENTAL AND CONTROL SECTIONS 


TV Phase LC Phase 
Course Achievement Course Achievement 
Section | —————— |! ratio| Section | —— ——— — |t ratio 
M | SD M | sD 
Foundations of | TV 123.76] 16.03] 0.25 | Business and LC 70.03| 8.58) 0.70 
Human Be- | Control |123.11| 15.05) Government Control | 71.28] 7.05 
havior 
Introductory TV 110.09| 20.91| 0.24 | Compositionand) LC 28.94) 5.59| 1.33 
Bociology Control |109.53| 16.90) Literature Control | 30.78] 4.74 
Physiology TV 208.92) 31.33] 0.66 | Essentials of LC 79.83} 10.59} 0.83 
Control |204.50| 20.44 Geography Control | 78.54] 9.56 
Human Biol- TV 253.97) 15.21| 3.00*| Elementary LC 109.45) 15.93) 1.77 
ogy Control |248.00| 15.76) Psychology Control |102.52| 21.77 


* Significant at better than the .001 level. 


results obtained from the present samples of students may not 
generalize to other samples enrolled in the same courses chiefly 
because of the relatively small size of some of the control groups. 


Achievement 


The distributions of achievement test scores in the experimental 
and control sections as well as appropriate £ ratios are summarized 
in Table 2. The null hypothesis is not refuted for any course 
except Biology, i.e., with the exception of this one course, compara- 


ble achievement resulted from experimental and control presenta- 
tions. 

Interpretation of the statistically significant finding in the case 
of Biology is confounded by the faet that the experimental and 
contro] sections of this course were given during successive semes- 
ters rather than simultaneously. Although a definitive interpreta- 
tion of the results obtained for Biology is impossible at the present 
time, it is apparent that neither TV nor LC presentations adversely 
affected achievement as defined by subject matter examinations. 
This finding for TV courses is in agreement with results reported 
earlier by the Pennsylvania State University (1). 

The absence, of differential achievement in experimental and 
ses qe qun he peii dient 

$ i cademie ability in interaction with 
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TABLE 3 
COMPARATIVE ACHIEVEMENT AS A DUAL FUNCTION or SECTION 
ASSIGNMENT AND LEVEL OF ABILITY 


High Ability Subgroup Low Ability Subgroup 
Course Section Achievement Test Achievement Test 
f-ratio 1 ratio 
M SD Ne M SD Na 
Foundations of | TV 120.28 | 29.74 43 0.34 115.81 | 16.06 43 1.37 
Human Be- Control 128.38 | 13.92 26 120.96 | 14.77 29 
havior 
Introductory So-| TV 117.80 | 13.13 55 0.97 103.04 | 14.67 46 0.12 
ciology Control 115.52 | 12.60 07 102.70 9.54 40 
Physiology TV 221.70 | 29.34 | 37 0.41 197.87 | 29.20 | 39 0.25 
Control | 217.73 | 19.27 11 195.76 | 27.28 17 
Human Biology TV 258.03 | 13.81 95 2.20° | 249.56 | 18.84 86 2.61* 
Control | 253.14 | 16.50 98 241.45 | 17.12 68 
Business and LC 70.86 8.06 49 0.81 69.02 6.93 51 0.31 
Government Control 72.86 8.14 14 69.80 8.80 15 
Composition and| LC 32.43 2.79 14 0.06 26.73 5.57 22 2.09> 
Literature Control 32.33 5.29 15 29.73 3.95 22 
Essentinls of LC 83.81 9.19 42 1.14 76.11 | 10.49 43 0.22 
Modn. Jeog- Control 81.59 8.32 41 75.63 9.80 44 
raphy 
Elementary Psy-| LC 113.02 | 13.98 44 1.03 104.53 | 17.15 32 1.75 
chology Control | 108.61 | 17.69 18 92.55 | 24.06 u 


^ Fluctuations from total N in the sample caused by unavailability of ACE scores or absence 
from final examination. 
bp < 05. &p < 02. 


Section assignment. In order to investigate the possible existence 
of such interaction, the students within each experimental and 
control group were dichotomized with respect to ability on the 
basis of percentile conversion of total ACE store. The cutting 
point for this dichotomization was the fiftieth percentile. Compara- 
tive achievement of the “high ability” subgroups in the experi- 
mental and control sections and of the “low ability” subgroups in 
these sections is summarized in Table 3. 

The data presented in Table 3 support the conclusion that level 
of ability does not interact with assignment to a TV section or its 
control counterpart asa dual determinant of achievement. A similar 
conclusion applies to all LC courses with the exception of Composi- 
tion and Literature, The Statistically significant difference for the 
“low ability” students in this one instance is the result of both a 
mean difference and a variance difference. It might have resulted 
from the unreliability of the final examination or from sampling 
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errors. If replicated with a more reliable criterion and larger sam- 
ples, it would indicate that this is one course wherein the less able 
students profit more from small class instruction than from LC 
presentations. The high ability students in this course, however, 
performed equally well under both classroom conditions. 


Students’ Attitudes About the Instructor and Course 


It is desirable to camouflage the identity of specific courses and 
instructors for presentation of attitudinal data. These data are 
presented for courses designated by a letter rather than a title. 
The order in which the courses are hereafter listed differs from the 
order of listing in the previous discussions. 

Group data summarizing the results obtained from administra- 
tion of the I scale and the C scale in the experimental and control 
sections are summarized in Table 5. In courses taught by multiple 
instructors, the mean scores on both scales were obtained by pool- 
ing the data across instructors. 

It is apparent from the data exhibited in Table 4 that TV instruc- 
tors were rated at least as favorably by students in their television 
sections as by students in their control sections. Two of the four 
LC instructors, however, received significantly less favorable 
ratings in the experimental section than in the control section. 

Referring again to Table 4, there is a pronounced tendency for 
students in the control sections to rate the course content more 
favorably than students in either the TV or LC sections, although 
the obtained difference is statistically significant in only three of 
the seven courses for which data were available. Apparently, 
instruction given simultaneously to large groups of students (either 
by TV or in large classes) is accomplished at some sacrifice of the 
students’ favorability to the course even though it does not affect 
their examination performance. Although the data are not ame- 
à w direct comparison of TV and LC instruction, they lead to 
p ora wee this deficiency may be more marked in the 

struction than it is for TV instruction. 


Attitudes About TV and LC Instruction 


Gro 
Table E [nr to the TV and LC scales are summarized in 
Kis e ois fiducial limits extending the mean allow for an 
"pretation of this value in terms of favorability or 
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TABLE 4 
Summary or LScanLE AND C-ScALE RATINGS IN EXPERIMENTAL 
AND CONTROL SECTIONS OF 


I Scale? € Scale 


Course Section — — - ü 
M SD t-ratio M SD t ratio 
A TV 2.80 1.00 — 3.08 0.50 — 
Control» — — — M 
B TY 4.56 1.74 1.35 4.50 1.24 2.27* 
Control 1.10 1.40 1.00 0.91 
G TV 1.16 0.88 2.86** 1.39 1.26 0.94 
Control 1.56 1.24 4.23 1.26 
D TV 2.94 1.02 0.11 3.43 0.88 0.94 
Control 3.14 1.00 3.28 0.85 
E LC 1.64 1.16 2.14* 3.91 1.07 3.18** 
Control 4.00 1.48 3.21 0.86 
F LC 3.58 1.06 2.67** 3.83 1.07 2.88** 
Control 3.10 1.16 3.37 0.89 
G LC 3.42 1.08 0.77 3.97 1.10 1.26 
Control 3.22 0.98 3.68 0.83 
H LC 3.60 0.80 0.00 3.93 0.95 1.90 
Control 3.62 0.40 3.50 0.95 


m 


a Scores below 5.0 denote ratings as ‘‘more effective than average;" the 
lower the mean score, the more favorable the rating. 

b Data not available. 

* Significant between the .05 and .01 levels. 

** Significant at better than the .01 level. 


TABLE 5 
Summary or TV anp LC Scare Responses 


TV Scale LC Scale 


Fiducial Fiduci 
Course M SD Limits* Caes M SB d 


4.91 0.83 +0.14 
6.19 0.81 +0.18 
5.42 0.99 +0.20 
4.76 0.94 +0.20 


5.74 0.72 40.16 
5.45 0.84 +0.19 
5.35 0.81 +£0.96 
6.00 — 0.64 4.0.13 


cane 
mosh 


* Fiducial limits of the mean calculated for the .05 level of confidence. 


unfavorability to TV and LC instruction (taking 5.0 as a neutral 
Position). 

Two conclusions follow from these data: 

1. It is impossible to generalize on students’ attitudes about TV 
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instruction. Students in one course preferred TV instruction to 
conventional instruction (Course D); those in another course 
considered the two types of presentation to be of approximately 
equal effectiveness (Course A). However, students in two of the 
TV courses (B and C) felt that television was inferior to conven- 
tional instruction. 

2. Students in all LC courses would have preferred assignment to 
a small section. The degree of this unfavorability, however, varied 
from slight (Courses F and G) to considerable (Course H). 

The apparent variability in TV and LC attitudes from course 
to course is indicative of the fact that factors other than section 
assignment play a role in conditioning students’ attitudes about 
the mode of instruction. The potential influence of two such 
factors (students’ ratings of the instructor and academic ability) 
was investigated. Correlations between both I scale responses and 
ACE total score with responses to the TV and LC scales are 
exhibited in Table 6. Note that significant correlations involving 
the ACE are indicative of an inverse relationship, i.e., the higher 
the ACE score, the less favorable the TV or LC attitude. 

It is apparent that if any ability subgroup is unfavorable to TV 
or LC instruction, it is more likely to be the high ability students 
than the low ability students. However, the inverse relationship 
between academic ability and attitudes toward the mode of pres- 
entation may be overcome by factors which are not yet entirely 
clear. One such mitigating factor is probably student perception 
about the instructor. There is a marked tendency for academic 


TABLE 6 
CORRELATIONS BETWEEN I-ScaLe Responses AND ACE TOTAL SCORE WITH 
ATTITUDE TOWARD EXPERIMENTAL TYPES OF INSTRUCTION 


TV Phase 


LC Phase 
Course I Scale ACE Course I Scale ACE 
i -25b -00 E 32» .06 
n -10 .92 F 13 .02 
5 -17 .23* G .10 .98* 
.365 .14 H 248 .04 


^ r exceeds 1.96 


ti 2 T 
B mezea dn Hye 1mes standard error of r of 0.00 (p < .05). 


times standard error of r of 0.00 (p « .01). 
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ability to correlate inversely with TV and LC attitudes in those 
courses where ratings of the instructor do not correlate with these 
attitudes. 


Conclusions 


1. Acquisition of subject-matter knowledge as measured by 
objectivity retest performance was not adversely affected by 
assignment to large (experimental) section (TV or LC) r 
than to a small (control) section. 

2. Academic ability did not interact with type of section assign- 
ment (experimental or control) as a complex determinant of 
achievement in any of the televised courses. A similar conclusion 
applies also to the LC courses under investigation with the possible 
exception of Composition and Literature. 

3. Student ratings of the instructor (I scale) were not adversely 
affected by television. The only statistically significant difference 
between ratings by the TV and control sections occurred in one 
course wherein the instructor was rated more favorably by his TV 
section. Ratings of the instructor did tend, however, to be less 
favorable in LC sections than in control sections. The mean 
difference was statistically significant in two of the experimental 
courses. 

4. Student ratings of the course (C scale) were less favorable in 
both TV and LC sections than in the control sections. This de- 
ficiency appears to be even more serious in LC sections than in TV 
sections (although data amenable to a direct comparison between 
these two types of presentation were not available). 

5. Students’ attitudes about TV instructi 
held across courses. In contrast, students i 
unfavorable to the mode of instruction wh 
class instruction. 

6. When academic ability was related at 
LC attitudes, the direction of this relatio: 
high ability students were less favorable th 
However, the fact that such a relationshi 
courses suggests the likelihood that oth 
academic ability in this regard. One s 
probably the students’ attitudes about 
instructor. 


ather 


on were not uniformly 
n all LC courses were 
€n compared to small 


all to students’ TV and 
nship was inverse, i.e., 
an low ability Students. 
P was not present in all 
er factors interact with 
uch interacting factor is 
the effectiveness of the 
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BOOK REVIEW 


D. B. Kurt. Mental Hygiene: A Survey of co. 
i and. Mental Health. New York: Henry Holt, MSB : en 
This is the first revised edition of a book which eran in E 

It follows the same general plan and seeks to follow "y aes e 

which considered ways of improving the teaching o po n n 
specially in emphasizing intellectual development rather ha 

aA interest. This book, therefore, is not merely a book on 

dac tieni but deals more fundamentally with principles 

* Sepe "o 1 and 2, discusses the nature and scope of 

Eos bs "d gives a summary of its history and pee 

art Two, Chapters 3 and 4, deals with the nature anid i inds o 

2 lis "der. Part. Three, Chapters 5 through 9, takes up the 

mental c res ardblem of the prevention of mental disorder, 

— Four, Chapters 10 through 18, on the promotion of mental 

nd “Ended discussions of the home and the personality; 

rocas problems from infancy through erpe A the 
lynamics of conscience; fixations; coping with reality; problems 

2 pression, conflicts, frustration, anxiety, emancipation, and 

a ae economie problems, security, pm and the aged; 

two popular forms of idolatry—grades and competitiv = e 

and the contribution of the right kind of religion to men ta ten $ 

“There are many ways to achieve personality integr ation, many 
ways to keep an even keel through the storms of joe? beta ate 
many individual differences, There are other and per haps epe) 
ways than the “hazardous trail of the go-getter. Recognition [n 
the proper values in life may help us to avoid the exaggerated 
devotion to dollar-chasing as the chief virtue of daily life. RU may 
help us to a healthier life than that of “relentless competition and 
power-seeking and self-aggrandizement.” 

Although the book is not written prim 
view of applied psychology, but rather fr 
principles, the reader ean find many prae 
parts of the book. Especially, he should be helped to a clearer 
understanding of psychology, particularly in its relation to mental 
health; and he can do so without being lost in technical language 
and unnecessary, complicated jargon, unverified assumptions, and 


383 


success— 


arily from the point of 
om that of fundamental 
tical applications in many 
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reified elements, that are prevalent today and often pass under the 
name of psychology. It is generally clear and simple in presenta- 

tion, and a glossary (pp. 627-642) adds distinctly to the usefulness d 
of the book. 

A few minor adverse criticisms seem relatively unimportant in | 
comparison with the general excellence of the book. It is sufficiently 
comprehensive for a semester college course and can be understood 
by the beginning student in psychology. It will stand easily with 
the few really excellent books in its field. A good index (pp. 643- ( 
654) concludes the book. 

A. S. Epwarps < 
The University of Georgia l 
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CONSISTENCY OF SOCIOMETRIC STATUS 


POSITION IN MALE AND FEMALE HIGH 
SCHOOL STUDENTS: 


Riva R. WrenTHEIMER 


Western Psychiatric Institute, University of Pittsburgh 


The present study represents an attempt to investigate the con- 
sistency of male and female sociometric statu: 

use of sociometric retests spaced at intery 
months, and 8 months. A separate evaluation of male sociometric 
status consistency and female status consistency for these same 
retest intervals is also included. The sociometric criterion situations 
employed are in the general area of leisure-time activities. Product- 
moment correlation coefficients. were calculated to evaluate the 
consistency of the total sociometric choice frequencies obtained 
in the several testings. 


Previous studies have shown tha 


8 position through the 
als of 12 months, 20 


t sociometric test scores have 
partieularly when the retest 
as part of an extensive Study of 


(tent mates in summer camps). Zeleny (5) reported, at the college 
——— s el 


ia, Delmas Miller, Principal of University Hi 
to the many teachers who cooperated in this project. = ER s 
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level using five choices and one criterion (membership in a discussion 
group), coefficients ranging from .93 to .95 on readministering à 
sociometric test on successive days. 

These research findings as a whole suggest that a high correla- 
tion coefficient obtained by the method of immediate retest is 
probably indieative of a memory factor, as well as the slowness 
with which people change in their choice behavior. 

With a longer retest interval, coefficients go down as would be 
expected on an a priori basis. Bronfenbrenner (1) in a study with 
the entire population of the University of Michigan laboratory 
experimental school (with a range of students from nursery school 
tothe sixth grade) found that rank correlation coefficients between 
raw status scores obtained by each student in the fall and spring 
sociometrie testing varied from .27 for the nursery school group 
to .59 for the sixth grade group. He concluded that in classroom 
situations marked shifts in sociometric status were comparatively 
rare and that, on the whole, children tended to retain the same gen- 
eral social position, with this tendency more pronounced in the 
older age groups. Jennings (2, p. 235) spaced 
at an interval of eight months' distance fr 
and found with her sample of delinquent 
of .72 for leisure-time companion choice 


a Sociometric retesting 

om the original testing 

girls a reliability coefficient 
Present Study 

Sample Characteristics 


-grade boys and girls from one 
s Used in the original sociometric 
testing in March 1953. Retesting of these adolescents occurred in 
ber 1954. Since sociometrie choices 
e embers of home room groups, the 
home room assi adhered to in this high school is 
were given home room assignments 
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TABLE 1 
SAMPLE CHARACTERISTICS AT THE T 
NuMBER oF Ca 


"uxE or Eacn Socromerric TESTING: 
Mean AGE, AND Grape LEVELS 


March 1953 March 1954 November 1954 


Characteristics Sociometric Test Sociometric Test — Sociometric Test 
Total Number 200 200 97 
Males 103 103 50 
Females 97 97 47 
Mean Ages (in yrs.) 16.3 17.3 18.5 
Number of Grade Levels 
Tenth 97 0 0 
Eleventh 103 97 0 
Twelfth 0 103 97 


the testing periods should exist. Table 1 summarizes some re 


levant 
data about the subjects. 


Sociometric Measuring Instrument 


The same near-sociometrie questionaire previously used by Wert- 
heimer (4) on a comparable high school group was employed. 
This near-sociometrie questionnaire had been constructed in terms 
of the age and experience of high school adolescents. lt was evident 
that the criteria on which choices were to be made had to be mean- 
ingful to them in terms of their own interests and experiences. The 
questions used were confined to roughly comparable social situa- 
tions. The attempt was also made to keep the questions on the 
same “level” of social rapport. For example, it was conceivable 
that an acceptable companion at a school party might not be the 
same one chosen for discussion of a serious personal problem Ad- 
ditionally, all of the questions were phrased positively so that the 
adolescents would not have their attention called to the less favored 
members of each group and, in this Way, Cause even more severe 
social maladjustment to these students. 


The sociometric questionnaire that was used included the f ollow- 
ing instructions to the students: 


What we are going to ask you to do t 
will not become a part of your school r 
answers to it. Anything that you inclu 
only be used as research material, 


oday is not a test o 
ecord. There ar 
de here will be 


f any kind, and 
e no right or wrong 
confidential and will 
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In response to the following questions, be sure and write down the three 
persons’ first and last names. You may put down the same name in response 
to more than one question, if you would really choose the same person. 


No additional specific instructions were given to the students. 
They were able to make companion choices from either sex in all 
three sociometrie situations. The three criterion situations for 
which three choices were to be given were: 


1. Suppose you have just won first prize on a quiz program. Among the 
prizes is an expense-free trip to Canada for yourself and three students in 
your home room group. Whom would you choose to go along with you? 

a, ————————————— b. 

Qi, ——————————————— 
2. Suppose you are planning a party at your home and have to limit the 
number of people you can ask to it. Which three members of your home room 
class would you invite? 

a. b. 

6. 
3. Suppose you have a chance to spend the summer at a camp which has 
a section for girls and a section for boys. Which three members of your home 
room group would you prefer as fellow campers? 

a. b. 

c. 


Administration of Questionnaire 


Since all of the home room classes in the tenth and eleventh 
grades of the high school (in the March 1953 testing) were requested 
to fill in the questionnaire, it was necessary that its administration 
be closely geared to the school schedule. In consultation with the 
school authorities, it was decided that it could be most expedi- 
tiously administered by the teachers themselves. Accordingly, the 
experimenter met with them as a group, commented on the ques- 
tionnaire procedure, possible sources of difficulty, etc. They were 
given the questionnaires in large envelopes that they were requested 
to seal in the presence of the students after administration and to 
return the material to the school office on the same day. With the 
exception of specific instructions to the students to limit their 
choices to their own home room groups, the teachers reported no 
difficulties in eonnection with the administration procedure. 

The same procedure and Sociometrie questionnaire were used in 
the March 1954 and November 1954 administrations. 
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Sociometric Questionnaire Analysis 

A roster of each home room class was obtained and a frequency 
tabulation made of the number of times each home room member 
was chosen by any other member as a companion for any or all of 
the criterion situations. Each person in the home room group ob- 
tained a total sociometrie choice frequency based on three choices 
for each of the three leisure-time activity situations suggested in 
the questionnaire. 


Treatment of the Data and Results 

To evaluate the consistency of these total sociometrie choice 
frequencies obtained by the designated students (both as a group 
and in terms of the male and female dichotomy) on subsequent 
sociometric testings, Pearson product-moment correlation coeffi- 
cients were caleulated between (a) March 1953 and March 1954 
data for the total group and for the males and females separately ; 
(b) March 1953 and November 1954 data for the entire group and 
for males and females separately; and (c) March 1954 and Novem- 
ber 1954 data for the total group and for males and females sep- 
arately. 

The obtained correlation coefficients are presented in Table 2. 

All of the nine coefficients are statistically significant beyond 
the .01 level and vary in size from .55 to .71. These data suggest 
that there is a high degree of consistency in sociometric status 
positon (as shown in expressed preferences for leisure-time com- 

TABLE 2 


Propuct MoMENT CORRELATIONS or TOTAL SocromETRIO FREQUENCIES 
or MALE AND FEMALE Hian Scuoor STUDENTS IN 
THREE Socromerric TESTINGS 


March 1953- March 1954- 
March 1953-March 1954 November 1954 November 1954 
Comparisons Comparisons Comparisons 
assas Imas 
Groups N r df N r df N r df 
Total 200 .55 198 97 .66 95 97 .68 95 
Male 50 .56 48 50 .69 48 50 .67 48 
Female AT .62 45 47 .62 45 47 .72 45 


Note.—All of these correlation coefficients are significant beyond the 
.01 level for the degrees of freedom indicated in each ease. 
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panions) in adolescent high school students as a group and that 
this consistency is maintained by both males and females. 

While this consistency in choice position may vary somewhat in 
different high schools (as, for example, in home room grouping 
systems that are more fluid than the system in the present study), 
it is likely that social status in leisure-time activities is well struc- 
tured in the tenth grade and is probably accentuated but at least 
maintained until the last year in high school. These data further 
suggest that a pronounced group development had occurred by 
the tenth grade in the sample used in this study. 


Summary 


The present study investigated the consistency of male and 
female adolescent sociometric status position through the use of 
sociometric retests spaced at intervals of 12 months, 20 months, 
and 8 months. A separate evaluation of male sociometrie Status 
consistency and female sociometrie status consistency for these 
same retest intervals was also included. The sociometrie criterion 
situations employed were in the general arca of leisure-time activi- 
ties. Product-moment correlation coefficients were calculated to 
evaluate the consistency of the total sociometrie choice f requencies 
obtained on the several testings, and all of these coefficients were 
statistically significant beyond the .01 level. The obtained results 
suggest that there is a high degree of consistency in sociometric 
status position (as shown in expressed preferences for leisure-time 
companions) in adolescent high school students as a group and 
that this consistency is maintained by both males and females. 
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AN EXPERIMENTAL STUDY OF THE EFFECT 
OF EXTERNAL DIRECTION DURING 
LEARNING ON TRANSFER AND 
RETENTION OF PRINCIPLES 


Jack E . KrrrELL 
University of California, Berkeley 


Itis not unusual for twoextremes to beassociated with educational 
praetice: (a) those teaching-learning situations containing such a 
minimum of structuring that they can be characterized as relatively 
independent discovery situations, and (b) those situations contain- 
ing complete presentations of selected, specific principles and facts 
and characterized as rote memory situations. Various research 
findings have indicated that, perhaps, amounts of direction during 
learning that lie between those two extremes may be of value in 
promoting maximum learning, transfer, and retention when used 
in classroom situations. The results of this investigation support 
suggestions that some direction during learning aids discovery, 
transfer, and retention of principles. 


'The Problem 


Experimental research has produced evidence that learner ac- 
tivity is of value in the discovery of principles. Classroom provi- 
sions for active pupil discovery have been found to be superior to 
methods emphasizing the specification of correct responses by the 
teacher in advance of discovery (1, 10, 13, 14). However, the ex- 
treme proposition that the more the freedom of learner activity 
and the less the amount of external direction, the better, as has 
been inferred in educational literature (11, p. 141; 15, p. 80) is not 
necessarily a valid one. Various experiments have demonstrated 
that some degree of direction in discovery is superior to none at 
all in promoting transfer (7, 8, 9). 

Three recent experiments somewhat similar in design and pur- 
pose to the one reported here have produced conflicting inferences. 
One experiment, using sixth-grade pupils as Ss, produced results 
favoring independent discovery as superior to specification of cor- 
rect response in advance of discovery activities (12). 
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Two other recent experiments used college graduates and college 
Students as Ss. The first of these two experiments found that of 
four different degrees of direction during discovery, ranging from 
a control group receiving no direction to a group supplied under- 
lying principles but no correct responses, the latter was superior 
in promoting transfer (4). The other experiment compared the 
effects of supplying principles as direction and independent dis- 
covery on transfer and retention (3). No significant differences 
were found in the discovery of new principles. After an interval of 
one month the group supplied with principles during discovery 
retained signifieantly more learned principles than the control 
group. 

'The results of these experiments suggest that direction, in the 
form of underlying principles, supplied during discovery is bene- 
ficial in promoting transfer to similar problems and to retention 
after one month. It would seem important to conduct further ex- 
periments with elementary school pupils in situations somewhat 
comparable to actual teaching-learning situations to obtain evi- 
dence of the relative effects of Minimum, Intermediate, and Maxi- 
mum amounts of direction during discovery on transfer to situa- 
tions of varying degrees of differentness and on retention. 

The hypotheses tested in the present experiment were that the 
amount of direction characterized as Intermediate (a) effects 
greater increases in learning and transfer to three situations of 
varying differentness, and (b) promotes greater retention through 
periods of two and four weeks. 


Materials and Procedures 
Subjects 


The Ss of this experiment, 132 sixth-grade pupils from public 
elementary schools in Pullman, Washington, were placed into 
three treatment groups through stratified-random selection. Strati- 
fication was based on high, medium, and low reading achievement 
classifications, 

Final membership in the Minimum treatment group numbered 
45, in the Intermediate treatment group, 44, and in the Maximum 
treatment group, 43. 

Treatment group means and standard deviations of S.R.A. 


Primary Mental Abilities intelligence quotients and chronological 
ages are shown in Table 1. 
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TABLE 1 


SRA INTELLIGENCE QUOTIENT AND CHRONOLOGICAL AGE MEANS AND 
STANDARD DEVIATIONS FoR THREE TREATMENT GROUPS 


Treatment Groups 


Statistic Minimum Intermediate Maximum 
Number of cases 45 44 43 
IQ mean 107.0 105.98 106.30 
Standard deviation 13.09 12.55 12.84 
C. A. mean (months) 131.26 131.68 132.33 
Standard deviation 4.55 4.48 5.06 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF NUMBER OF PRINCIPLES KNOWN 
ON PRETRAINING TEST FOR THREE TREATMENT GROUPS 


Treatment Groups 


Statistic Minimum Intermediate Maximum 
Mean 1.444 1.864 1.630 
Standard deviation 1.500 1.540 1.100 


The statistical difference among treatment-group IQ means 
were tested by referring the ratio of the variances among group 
means to the variance within groups to a table of the F distribution. 
The F of .063 was obtained indicating no significant differences 
among group means. 

The distribution of chronological ages could not be accepted as 
normal and a nonparametric test, analysis of variance by ranks, 
was applied.to test the significance of differences among the three 
treatment groups. The H value, 1.03, was found to be not signifi- 
cant. H, with the group numbers used in this experiment, approxi- 
mates a chi-square distribution. 

The mean number of principles known on the pretraining test 
and standard deviations for the three treatment groups are pre- 
sented in Table 2. 

The distribution of number of principles known on the pretrain- 
ing test could not be accepted as normal, and again analysis of 
variance by ranks was applied to test the significance of differences 
among the three treatment groups. A resulting H of .0526 was 
found not to be significant. 
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On the basis of the above analyses it was assumed that the three 
treatment groups did not differ significantly in initial mental 
ability or pretraining knowledge of the principles used in the ex- 
periment. 

Pretraining and Posliraining Tests 


'The pretraining and first posttraining tests for knowledge of 
principles consisted of 45 items grouped by threes for each of 15 
organizational principles. 

A second posttraining test, consisting of 45 items different from 
any used on other tests and grouped by threes for cach of the same 
15 principles, was used to measure transfer to new items based on 
previously experienced principles. 

A third posttraining test, consisting of 45 different items grouped 
by threes for each of 15 different organizational principles, was used 
to measure posttraining ability to discover and use unlearned prin- 
ciples. 

An example of the type of verbal problems used throughout 
this experiment follows: (a) Gone, (b) Start, (c) Go, (d) Stop, 
(e) Come. 

The five words of each item were selected so that four were 
related by some principle and one did not belong. The principle 
was the reason why one word did not belong and it became the 
learner's task to discover the principle and apply it in order to 
discover the word not belonging. In the example the principle 
was that of paired opposites. "Start" and "Stop" are direct 
opposites as are “Come” and “Go” but “Gone” is of the wrong 
tense to be a direct opposite of any of the other four and so does 
not belong. 

Twelve principles used in the pretraining, the training, and the 
posttraining tests were based on meaning relationships, one was 
based on sound relationships, one was based on familiar word 
combinations, and one was based on word arrangement and 
meaning relationships. If a wrong response was made for any 
item of a group of three, the S was given no credit for knowledge 
of the applicable principle. 
waa e distributed as booklets for group administration 
disini dien ied Instructions on the cover page. The directions 
underlyin ane ems were grouped and spaced on the basis of 

5 principles but gave no clue as to the nature of the 
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principles. The Ss were required to mark an X in the answer 
space corresponding to the word that did not belong with the 
other four in each item. 

Seventy-six of the 180 items used in this experiment and 10 of 
the 30 principles had been previously used in a recent, somewhat 
similar experiment (12). All of the items used were composed of 
words selected from the first thousand of each of two well-known 
reading word lists (5, 6). Such selection provided reasonable 
assurance that the items were not tests of reading ability for the 
sixth grade pupils serving as Ss. 

All three tests used for measurement in this experiment were 
administered to 87 sixth-grade pupils in another community prior 
to the initiation of this experiment. An alternate form procedure 
was used to determine reliability and after correcting for length 
of one test, a reliability coefficient of .74 was obtained. 

The third posttraining test composed of different items based 
on 15 different principles was compared with the first posttraining 
test for relative difficulty. The percentage of correct responses for 
each item was computed revealing a range from 6% to 72% with 
a mean of 35% for the first posttraining test and from 6% to 
79% with a mean of 23% for the third posttraining test. The 
same procedure was used on the basis of principles with a resulting 
mean of 8% for the pretraining test and a mean of 2.3% for the 
third posttraining test. Five educators acting as judges supported 
the accepted assumption that the third posttraining test was 
more difficult than the first posttraining test. 


Learning Materials and. Treatments 

Items used in the training tests were different but of the same 
type as those used for pre- and posttraining tests. The principles 
used for training were the same as those used in the pre-, first, 
ànd second posttraining tests. 

Three different combinations of clues to the principles deter- 
mining correct responses to the training items were incorporated 
into three sets of training materials, which when administered, 
each to a different group of Ss, effected different treatments by 
providing three different amounts of direction during learning. 
The procedure of administration was consistent throughout the 
training period and followed practices previously tested by re- 
Search. (3, 4). 
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The learning material for the group receiving Minimum direc- 
tion consisted of items arranged as in the pre- and posttraining 
tests. Three items based on the same principle were grouped 
together and separated from other groups of items by spacing. 
The Ss of the Minimum group were told that each group of three 
items was based on an underlying principle. No information was 
given this group as to the nature of the principles except that 
because of them one word in each item did not belong. 

'The group receiving Intermediate direction was provided with 
all the information supplied the Minimum group plus a verbal 
statement of the principle involved. The principles were printed 
immediately preceding the group of items to which they applied 
and were read orally twice with each administration of the items 
based on the principle. The stated principle for the item previously 
used as an example is: Form Two Pairs of Opposites. 

The materials used by the Maximum treatment group included 
all the clues provided the Intermediate group plus oral statements 
of the three correct responses for each group of items. 'The oral 
statements of correct responses were made for one group of items 
at a time prior to the marking of responses by the Ss. The oral 
statement made for the item previously used as an example was: 
"Start! is the opposite of ‘Stop’ and ‘Come’ is the opposite of 
‘Go’ so ‘Gone’ is the word that does not belong." 

Knowledge of correctness or incorrectness of responses was 
provided all Ss by the use of corrugated cardboard answer sheet 
backings, one surface layer of which was punctured to provide 
holes into which a pencil point dropped when the correct response 
for each item was marked. Such devices have been used success- 
fully with groups in other experiments (3, 4). Their use in the 
present experiment was accompanied by adequate proctoring 
and instructions which included a statement that Ss should con- 
tinue to mark answer spaces until the correct response was made, 
but that each incorrect response mark would lower the S's total 
Score and so no unnecessary marks should be made. 

The training period was five weeks in length with nine items 
based on three principles being presented and repeated twice 
each week for a total presentation of 45 items based on 15 prin- 
ue les. To discourage Ss’ attempts to rely on memory of correct 
ie arcas for the items presented a second and third time 

; the words in each item were varied in order for each 
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presentation on the basis of five different correct-response patterns. 
The same patterns were used in punching holes in the answer 
sheet backings. The five different patterns were used in rotating 
order. 

The pretraining test when administered the second time following 
training became the first posttraining test. The number of princi- 
ples actually learned and transferred was the number of principles 
known on the first posttraining test not known on the pretraining 
test. Treatment-group means and variances of the number of 
principles learned and transferred to the first posttraining test 
are presented in Table 3. 

Analysis of variance was used to test the significance of dif- 
ferences among the three treatment groups. The F, 4.73, was well 
within the .05 level of confidence. Accepting the hypothesis that 
significant differences existed among the three treatment groups, 
pairs of groups were compared through the use of a ¢ test. Table 4 
presents the differences between treatment-group means and the 


significance of ¢ for each. 


TABLE 3 


'TREATMENT-GROUP MEANS AND VARIANCES FOR NUMBERS OF PRINCIPLES 
LEARNED AND TRANSFERRED ON First PosTTRAINING Test 


Treatment Groups 


Statistic Minimum Intermediate Maximum 
Mean 2.555 3.660 3.744 
Variance 4.290 3.570 4.440 

TABLE 4 


SIGNIFICANCE or DIFFERENCES BETWEEN MEANS OF TREATMENT GROUPS 
ron PRINCIPLES LEARNED AND TRANSFERRED ON First 
PosrrRAINING TEST 


Difference Standard 


E f Level of 
Group Pair poca Difference t Significance 
Minimum and Intermedi- 1.106 .422 2.620 P.09 
ate 
Minimum and Maximum 1.189 .448 2.654 P.02 
Intermediate and Maxi- .084 .429 -196 Not signif- 
icant 


mum 
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Both the Intermediate and Maximum treatment groups were 
superior to the Minimum treatment group in learning and trans- 
ferring principles to previously encountered situations. "There 
was no significant differenee between Intermediate and Maximum 
treatment groups in this situation. 


Amount of Transfer to New Item Situations 


The number of principles learned and transferred to new item 
situations was the number of principles known on the second 
posttraining test not known on the pretraining test. The mean 
number of principles learned and transferred and variances for 
each treatment group as measured by the second posttraining 
test are presented in Table 5. 

An F of 18.87 was obtained through analysis of variance and 
found significant at the .001 level of confidence. The hypothesis 
that significant differences existed among the three treatment 
groups in the number of principles transferred to new item situa- 
tions was accepted. The ¢ test was applied to test significance of 
differences between pairs of treatment-group means. Table 6 


TABLE 5 
TREATMENT GROUP MEANS AND VARIANCES FOR NUMBERS OF PRINCIPLES 
LEARNED AND TRANSFERRED TO New ITEM SITUATIONS IN 
SeconDp POSTTRAINING TEST 


Treatment Groups 


Statistic Minimum Intermediate Maximum 
Mean 1.930 4.590 2.907 
Variance 3.520 4.160 5.190 

TABLE 6 


SIGNIFICANCE or DIFFERENCES BETWEEN MEANS or TREATMENT GROUPS 
FOR PRINCIPLES LEARNED AND TRANSFERRED TO SECOND 
PosTTRAINING TEST 


Difference Standard 
Between Error of Level of 
Group Pair Means Difference t Significance 
Minimum and Intermedi- 2.657 .415 6.40 P.001 
ate 
Minimum and Maximum .974 .444 2.19 P.05 
Maximum and Intermedi- 1.683 .458 3.68 P.001 


ate 
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presents differences between treatment-group means and the 
significance of ¢ for each. 

Inspection of Tables 5 and 6 reveals that the Intermediate 
treatment group was significantly superior to the other two treat- 
ment groups in the number of learned principles transferred to 
new item situations. The Maximum treatment group was superior 
to the Minimum treatment group in transferring principles to new 
item situations. 


Posttraining Discovery of New Principles 


The third posttraining test was composed of 45 items based on 
15 principles. Neither items nor principles had been previously 
encountered by the Ss during this experiment. The number of 
principles known on this test was determined by counting the 
number of principles for which S made three correct item re- 
sponses. Treatment-group means and variances of the number 
of principles known on the third posttraining test are presented 
in Table 7. 

The distribution of numbers of principles known on this test 
was considered questionable when the Chi-square Goodness of 
Fit test yielded a chi-square value of 15.03 found to be significant 
at the .02 level of confidence. Both a parametric technique, 
analysis of variance, and a nonparametric technique, analysis of 
variance by ranks, were applied to test the significance of differ- 
ences among the three treatment groups as to the number of new 
principles discovered. An F of 38 and an H of 30.88 were obtained, 
both significant at the .001 level of confidence. 

The t test and analysis of variance by ranks were both applied 
in testing for significance of differences between pairs of treatment, 
groups. As both techniques produced values at the same levels 
of significance only the results of the ¢ test are presented here. 


TABLE 7 
p VARIANCES FOR NUMBERS OF PRINCIPLES 


"TTagATMENT-GROUP MEANS AN 
P OSTTRAINING TEST 


Known on THIRD P 


Treatment Groups 


Statistic Minimum Intermediate Maximum 
1.733 4.590 2.674 
Mean : 
4.20 4.67 4.93 


Variance 
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TABLE 8 


SrGNIFICANCES oF DIFFERENCES BETWEEN TREATMENT Grour MEANS OF 
NUMBERS or PRINCIPLES KNOWN on THIRD PosrrRAINING Test 


Difference Standard 
] Between Error of Level of 
Group Pair Means Difference t Significance 
Minimum and Interme- 2.857 .446 6.41 P.001 
diate 
Minimum and Maximum .941 .4525 2.08 P.05 
Maximum and Interme- 1.916 .468 4.09 P,001 
diate 


Table 8 presents the differences between treatment-group means 
of the number of principles known on the third posttraining test 
and the significance of ¢ for each. 

The Intermediate treatment group was able to discover a 
significantly greater number of new principles than either of the 
other two treatment groups. The Maximum treatment group was 
able to discover a significantly greater number of new principles 
than the Minimum treatment group. 


Number of Principles Retained 


The pretraining test administered for a second time at the 
end of the training period as the first posttraining test was ad- 
ministered a third time two weeks following its second administra- 
tion and two weeks later it was administered for a fourth time to 
measure the number of principles retained for two and four weeks. 
The number of principles retained was determined by the number 
known on the first posttraining test and also known on the two- 
and four-week retention tests. 

'The numbers of principles retained by each treatment, group 
were compared establishing that the Intermediate group retained 
significantly more principles than the other two treatment groups 
for two and four weeks and the Maximum group significantly 
more than the Minimum group. However, more indicative of the 
p effects of the amounts of direction on retention were the 

Bebe the retention loss by each treatment group. 
tent Hon a ig by ranks for matched samples was used to 
A dig ance of retention losses. Table 9 presents the 

P Means of numbers of principles known on the 
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TABLE 9 
SIGNIFICANCE OF DIFFERENCES FOR EACH TREATMENT Group BETWEEN 
THE NUMBER OF PRINCIPLES KNOWN ON THE First PosTrRAINING 
TEST AND THE NUMBER REPEATED AFTER Two WEEKS 


Mean Number of Principles 


Origi " 
Treatment Group VEA Repeated x? Sint 
Minimum 2.555 1.60 15.22 P.001 
Intermediate 3.660 3.50 .82 P.40 
Maximum 3.744 2.86 20.03 P.001 


Note.—Analysis of variance by ranks, matched samples. 


TABLE 10 


SIGNIFICANCE or DIFFERENCES BETWEEN THE NUMBER OF PRINCIPLES 
RETAINED Two anD Four WEEKS ror EACH TREATMENT Group 


Mean Number of 
Principles Retained 


—— Level of 
Treatment Group Two Weeks Four Weeks x? Significance 
Minimum 1.60 1.177 5.60 P02 
Intermediate 3.50 3.295 1.15 P.30 
Maximum 2.86 2.300 11.12 P.001 


Note.—Analysis of variance by ranks, matched samples. 


first posttraining test, those retained after two weeks, x? values, 
and the level of significance for each. 

The Intermediate treatment group made no significant loss in 
retention of principles the first two weeks. Both the Maximum 
and Minimum treatment groups made significant losses in numbers 
of principles retained for two weeks. 

Table 10 presents the treatment-group means of numbers of 
principles known on the two-week retention tests and repeated 
on the four-week retention test, the x; values and the level of 
significance for each. 

Both the Maximum and Minimum treatment groups continued 
to make significant losses in retention of principles while the 
Intermediate treatment group still had made no significant loss 
in the number of principles retained. 

The significance of the losses of retention of principles by each 
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TABLE 11 
SIGNIFICANCE OF DIFFERENCES FOR Eacn TREATME 
NumBERS or PrincipLes KNowN on First Pos 
AND THE NUMBERS Rer 


Grove Br 
RAIN 
TED Arrer Four WEEKS 


WEEN 


Mean Number of Principles 


Originally " Level of 
Treatment Group Known Repeated xr Significance 
Minimum 2.555 1.177 20.00 P.001 
Intermediate 3.660 3.295 2.75 P.]0 
Maximum 3.744 2.320 25.84 P.001 


Note.—Analysis of variance by ranks, matched samples. 


treatment group for the total four-week retention period is 
presented in Table 11. 

'The Intermediate treatment group made no loss in retention of 
principles during the four-week retention period that was signifi- 
cant at the .05 level of confidence. Both the other two treatment 
groups made significant losses in the retention of principles during 
both of the two-week retention periods and for the total of four 
weeks. 


Discussion 


In the present experiment, sixth-grade pupils benefited from 
clues which, in addition to organization of materials and informa- 
tion that such organization existed, provided relationships on 
which to base discovery. 

Reasonable interpretation of all the evidence available indicates 
that Ss benefit from help given them in their search for bases 
determining correct responses. Specification of answers in advance, 
however, seems to encourage reliance on rote memory rather than 
discovering and applying on the basis of underlying relationships. 
Evidence provided by the present investigation seems to support 
such indications. 

The present investigation differed from previous experimentation 
como effects of Minimum, Intermediate, and Maxi- 
ind us. aa cones on transfer to three different situations 
miM ui. nol a within a single design. The 
discovery inert o irection, which provided as clues to 

s of underlying principles, proved to be superior 


in 
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to a Minimum amount of direction in promoting learning and 
transfer to all three types of situations. The Intermediate amount 
of direction was superior to Maximum amount of direction in 
promoting transfer to all but one type of situation. Two and four 
weeks after training, the group receiving Intermediate direction 
retained significantly more principles known in that situation 
than did the groups receiving Maximum or Minimum amounts 
of direction. The evidence provided by this experiment indicates 
that both the Minimum and Maximum amounts of directions 
during learning became less and less effective as the situations to 
which transfer was made became increasingly different and as 
the clapsed time between training and measurement of transfer 
became greater. The Intermediate amount of direction increased 
in effectiveness between the first and second transfer situations 
and maintained the new level of effectiveness on the third transfer 
situation. 

The scope of this experiment has acknowledged limitations. 
Should further research confirm the following implications, the 
teacher’s role in providing direction in learning-teaching situations 
would be clarified. 

Evidence provided by the present experiment suggests that: 

1. In addition to organizing the materials used in learning, 
teachers should aid pupil discovery by suggesting meaningful 
relationships on which learners may base discovery and by pro- 
viding practice with those relationships. 

2. Providing statements of underlying relationships without 
specifying answers fosters learning, retention, and transfer to 
different situations. 


Summary 


The purpose of this experiment was to determine the relative 
effects of three amounts of direction to learners in their discovery 
of established principles on transfer to differing situations and on 
retention of learned principles. 

Each of three groups of sixth-grade pupils was supplied a differ- 
ent amount of direction during discovery of principles determining 
solution of multiple-choice verbal items. The group receiving an 
Intermediate amount of direction consisting of organized materials, 
information of such organization, and statements of underlying 


- 
Ft 35 on 
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relationships, learned and transferred as many or more principles 
to three differing situations than the groups receiving less or 
more direction. After periods of two and four weeks subsequent to 
training, the group receiving an Intermediate amount of direction 
retained a greater proportion of learned principles than the other 
two groups. 

Evidence from this experiment in conjunction with that of 
similar experiments indicates that furnishing learners with informa- 
tion in the form of underlying principles promotes transfer and 
retention of learned principles and may provide the background 
enabling future discovery of new principles. 
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THE PERSONALITY OF JUNIOR HIGH AND 
HIGH SCHOOL STUDENTS ATTENDING A 
RESIDENTIAL SCHOOL FOR THE BLIND 


HERBERT M. GREENBERG, Lovise ALLISON, 
MILDRED FEWELL, AND CHARLES Ricu 


Texas Technological College, Lubbock 


The method by which the blind can best and most wofitably 
y 1 
be educated has been a subject for controversy for MANY Sassi, 
The impact on the personality of the blind child of the residential 
school which educates better than three-quarters of all blind 
children is widely debated in the field. Some work has been done 
regarding the intellectual effects (B), which in general has shown 
isfactory i ectual growth for blind students. Nothing, how- 
satisfactory intelle 1 growth for blind students. Nothi "p 
ever, has been done in the way of focusing on the school for the blind 
as un effecter of personality. It is the purpose of this study, through 
s » from a representative school for the blind, to 
use of a sample from p | : 
are the norms of this group on several personality traits 
wh eral norms of their sighted age mates. In other words, 
Hd adam is ly to investigate a hypothesis that 
it is the purpose of this study ) i 
lic herr i -esidential school for the blind will be: 
ts attending a residential s o! 
eiie : i dnm sclf-sufficient, more submissive, less self- 
confid ge " sociable, and more authoritarian than the 
confident, ab as soci 


norm for the sighted group for these traits. 


Procedure 


Subjects n ‘jenlly all 
One hundred and three students, constituting Lee ome 
those attending grades six through twelve in a ei ier iek of 
for the blind, were tested. This school has ro qu dis 
students. By far the largest number live at ihe schoo E 
approximate 914-month academic year and return e sie 
summer. A few whose families live in the town in which t E i 
is located return home daily or on weekends. Living at home }s 
greatly discouraged by the school administration. : 
The students represent a cross section of backgrounds and 
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reasonably accurate cross section of the socioeconomic groupings 
of the state. There is a wide range of academic ability, with the 
norm probably being very similar to that of the sighted popula- 
tion. Those few graduates who attend college are generally quite 
successful, though it should be noted that this number is quite 


small. 


Test Battery 

The Bernreuter Personality Inventory, the California F Seale, 
and a personal data form were administered to the students by 
administrators working simultaneously with four different groups. 

The Bernreuter Personality Inventory has been standardized 
on thousands of high school and college students and adults. 
Percentile norms have been established for each of these groups, 
and for males and females in each group. The traits measured by 
this instrument are: neurotic tendencies, (BIN); self sufficiency, 
(B28); introversion-extroversion, (B3I); dominance submission, 
(B4D); self confidence, (F1C); social ability, (F28). The B3I was 
not employed in the present study, due to its very high correlation 
with BIN and FIC. The Bernreuter consists of 125 items to which 
the respondent answers with a yes, no, or question mark. 

The California F Seale was constructed by Adorno and others, 
and has been standardized on several thousand students and adults. 
It consists of 29 statements to which the S responds by various 
shadings of agreement or disagreement (strongly apres TA T1 
TA mildly agree +1, strongly disagree —3, ~~ =~ ae y 
disagree — 1). An example of the type of items employed 1s “la- 
miliarity breeds contempt.” : 

A re dian, pac min been employed by the senior author 
in his doctoral dissertation (4). It was designed to take data 


. je t the establish- 
ertaini ackeround of the subject so that s 
oe possible. It contains such 


ment of comparison groups would be 0 E 
. a income of family, religious 


items as age, sex, approximate yearly 
persuasion, length of time at school, satisfaction with school, 
grades, ete. It is not in any way to be deemed a psychological 


lest of a standardized instrument. 


Methods of Collecting Data 
In order to eliminate so far as possible the administration as a 


variable, a standardized administration was used by all four. 
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A Me was prepared, which was followed exactly by each 
nei ough rules as to the type of questions that could bé 
E swered were made and followed. Only definitions of words could 
Le. alee aes e oe ip I of test items was periti 
> ) were able respon ed on typewri r i I 
ane while others employed A irri nr 
E. he respondents were told not to put their AES anywhere on 
> dee and were assured that their responses could not be traced 
in any way to them. Tt was felt that with this guarantee of s 
ity more candor might be hoped for. Midi 


Methods of Treating Data 


Means and standard deviations v 
e Em go Subgroup A—6th, ih, wai cem € "chic 
9th and 10th grades; and Subgroup C 11th ; a Subgroup B 
These groups were further broken down into hes 12th grades. 
the Bernreuter B scales due to the differential ri € eriala 4 SE 
of these scales on the inventory. The t ratios a 
tween means of each subgroup for each of the pula t. taken be- 
sults for each question of the personal data form were € " 
ate 


for each of the subgroups. 
Results 


Table 1 gives the mean percentile for the several subgroups f 
: : : or 

each of the traits measured. From this table it can be seen that 
the group exhibited high neurotic tendencies. The lowest subgroup 
had a mean score at the 66th percentile, while the remainder were 
at the 80th and above. It can also be seen that the group is quite 
dependent. One subgroup has a mean as low as the 15th percentile, 
while the most self-sufficient subgroup had its mean at the 34th 
percentile. 
See further be seen that this trend is carried out in the 
ids: oo seale, the groups lying on the submissive 
sienificard pate only trait besides the authoritarian m which 
In this case it nd Seton found between any of the subgroups. 
t ratios being 3.15, o rp the youngest and the oldest males, 

From the high diu above the 1 % level of significance. — 

be assumed that ore tic trend exhibited by the group, it might 
is clearly indicated b is a lack of self-confidence. This, in fact, 
y Table 1 which shows that no subgrouP 


—— ---SS-r —" —— — 
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TABLE 1 


RESULTS FOR THE SEVERAL TESTED Traits BERNREUTER 
SCALES EXPRESSED IN PERCENTILE FIGURES* 


Group B1-N B2-S Bi-D Fr-C F2-S F scale 
Male A 82 15 15 81 29 152 
Female A 66 22 41 81 29 152 
Male B 83 19 23 86 25 167 
Female B 83 17 37 86 25 167 
Male C 81 21 46 80 45 151 
Female C 80 34 54 80 45 151 


^ The first five traits on the above table are scales of the Bernreuter 
Personality Inventory taken from the manual of that inventory. The sixth 


trait refers to the California F scale. 

Bl1-N—Neurotie Tendencies; B2-S—Self Sufficiency; B4-D—Domi- 
nance-Submission; F1-C—Self Confidence; F2-S— Social Ability; F scale— 
Authoritarianism. 


has a mean below the 80th percentile, thus demonstrating a severe 
lack of self-confidence. 

Tt should be noted that the trait of sociability is the only one 
in which the group lies on the healthy side of the norm, The most 
unsocial subgroup having a mean percentile of 45, however, au- 
thoritarianism is carried out in the pattern set by most traits, 
the means ranging from 151 to 167 on a scale ranging from 29 
to 203, the high numbers representing high authoritarianism. It 
should be further noted that the 9th and 10th graders show a 
ntly more authoritarian score than the 6th, 7th, and 8th 


significa à 
the 1 ratios being 4.71 or again well above the 1% level of 


graders, 
significance. 

The most noticeable result obtained by the personal data form 
was the apparent widespread dissatisfaction with the school. 
In Subgroup A, 27 would choose this school again if having to 
choose again, while 17 would not; in Subgroup B, 13 would choose 
again while 17 would not; and in Subgroup C, 16 would choose 
again and 17 would not. Also, many who answered affirmatively 
did so because they felt that there was no alternative but expressed 
great hostility toward the school. The subgroups answered the 
questions quite consistently with one another. There were no 
differences in the responses to the other questions on the personal 
data form as great as the difference indicated in the above para 
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graph except, of course, for the obvious differences of age and 
length of attendance at the school. 


Conclusions 


It is concluded that there is poor mental health at this residential 
school for the blind, as in most of the traits tested the mean per- 
centile is far over on the unhealthy side of the 
firming the hypothesis. The researchers might pose two alternative 
causal possibilities. The personality difficulties might be due to 
the sensory loss itself, or to some peculiarity of the particular 
school. It should be noted in this connection that, in previous 
research (4), scores still decisively unhealthy but mum closely 
approaching the norm were found in other residential sifeels 
while scores clearly on the healthy side of the scale ihn found 
for the blind attending publie schools. Thus some wej} ~~ ht 
be lent to the second alternative. VIS ES 


scale, thus con- 
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A PROCEDURE FOR MANAGING TEACHING- 
LEARNING SITUATIONS IN EDUCATIONAL 
PSYCHOLOGY 


Ray H. Siursox 
University of Illinois 


Can the teacher of educational psychology seriously attempt 
to practice what is preached? Can the course in educational psy- 
chology, or any psychology course for that matter, give daily 
evidence to students that knowing psychological theory and prin- 
a desirable difference in the way teaching- 
learning situations are handled? In his own classes dare the teacher 
of psychology try to use what he knows in guiding the work of 
his own students? Most teachers of psychology have probably 
given some attention to questions such as these. The folowing 
description represents one attempt developed over a long period 
of years to answer these questions in the affirmative. It is not 
given as a model but rather as an approach which may be sug- 
gestive to others who are also concerned about possible ways of 
translating psychological theories and principles into practice in 
realistie teaching-learning situations. 


ciples really makes 


Course Objectives 

The general purpose of the course in educational psychology, 
as scen by the writer, is to prepare each student to do effective 
and satisfying professional work through the identification and 
application of psychological principles. Learning the attitudes 
and skills that will be needed for professional growth while teach- 
ing is emphasized; memorizing printed materials for test purposes 
is discouraged except as this can be shown to contribute to profes- 
sional growth. Since the students in the current educational psy- 
chology course expect to teach, the attempt will be made to: (a) 
focus attention on the improvement of teaching and learning, 
(b) provide for cooperative consideration by the teacher and the 
learners in the class of the various processes involved in improv- 
ing the teaching-learning situation, and (c) encourage the under- 
standing and frequent usc of varied methods of attacking pro- 
fessional problems in teaching-learning situations. 

411 


412 Ray H. Simpson 


Course Activities 


The writer’s purposes in the educational psychology course are 
revealed in the outline of the major course activities, which are 
discussed in this paper: (a) identifying some key problems in 
teaching-learning Situations, (b) selecting problems for initial 
study, (c) working toward problem solution, (d) keeping a student 
learning record, (e) self-evaluation, (f) finding and using re- 
sources, and (g) developing effective interpersonal relations. 

With the guidance of the instructor, each student considers a 
great number of teaching-learning problems or situations. For 
example, consideration is given to such questions as these: On 
what basis should we group in this class? Should the total class 
group be used for all purposes? What use, if any, should be made 
of small groups? For what purposes might small groups be 
formed? Should out-of-class study groups be encouraged? What 
criteria should be used in marking in order to promote desired 
learning? These and similar questions relating to situations both 
in and out of the class vary with respect to implied educational 
objectives and methods. Some goals and activities are those of 
individuals, some of small groups, and some are those common 
to the whole class. In this class much ribns is given to helping 
students develop sound goals and purposes which were not initially 


recognized. 


Identifying Problems oe 

In order to provide a sound basis for later ee a a 
each learner, during the first few days of usc pes mie (hod 2 
to identify and list a number of professional pro bine exerted 
to be of concern to him. Those who have ater a ae 
are encouraged to draw on this experience. Students ci 
ing experience may object that they do not know any pt ange 
problems. Actually, most students will have had a mini ia 
13 years in teaching-learning situations. Through instructor gui 
ance and some supervised observations of elementary and sec- 
ondary school classrooms, the writer has found that undergraduate 
students without teaching experience show considerable ability 
in identifying teaching problems. However, direct experiences are 
not relied on exclusively. , 

Students are encouraged to use many types of resources in 
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widening their perception of the important issues now facing 
educators and of the problems they are likely to meet as teachers. 
Recommended resources for problem identification include: direct 
observation of classrooms taught by other teachers, educational 
psychology texts, other printed materials—particularly in educa- 
tional and psychological magazines—other students and their dif- 
ficulties, and other teachers and their problems. The following are 
problems illustrative of those listed by students: 


How ean I more adequately adapt my instruction to the individual dif- 
ferences in classes in ninth-grade general science? (Origin: own experience 
as a student in general science class which was not challenging to me.) 

What techniques can a young and inexperienced administrator use to 
achers to modify their practices? (Origin: talking with 
a recent college graduate who was somewhat discouraged.) 

What are some of the guiding principles and procedures that can be 
used in working with slow learners in the classroom? (Origin: table of 
contents of an educational psychology text.) 

What is good discipline and how can I promote it in my classes? (Origin: 
Experience as a student in classes. Also difficulties of a teacher in classes 


observed while I was a college sophomore.) 


encourage older te: 


After the learners have compiled their lists, they are encouraged 
ther in small groups to select the problems they con- 


to get toge 
s this is done during class time 


sider most important. Sometime 
if it is difficult for groups to meet outside of class. The goals here 


are to help each learner expand the scope of his perceived prob- 
lems, to get the reaction of others, and to appreciate the kinds of 
problems which others see as important. It is believed that activ- 
ities will help the students in their approach to the professional 
problems they will later encounter. Conventionaily, of course, 
this is done, if it is done at all, almost entirely by the teacher, 


Selecting Problems for Initial Study 

After cach student has identified a number of professional 
problems which he might consider studying (usually between 15 
and 35), all are encouraged to help set up criteria which will aid 
in deciding which most deserve attention and study. The instructor 
usually suggests one or two questionable criteria and one or two 


better ones in order to get learners started, e.g.: 


Questionable: What is easiest to teach or learn without regard to impor- 
tance? What problem was studied first last year? 
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Better: What are the present and prospective needs of individual learners 
in the class? (It is hoped that the students will eventually encourage their 
pupils to consider similar criteria.) What kinds of needed psychological 
training or practice is each learner not likely to get in other phases of his 
educational preparation? 


After hearing an example or two of cach type, each student 
tries to list both desirable and questionable or overemphasized 
criteria. He is encouraged to give his own reactions, not to try to 
anticipate the instructor’s attitudes, Subsequently, the ideas are 
pooled, and a list is put on the blackboard which involves the 
combined judgments of the class members. Students are encour- 
aged to consider the signifieance of particular criteria in deciding, 
what problems it is best to study.! The criteria which are judged 
to be the most valid are then used in deciding which problems 
will be studied intensively. The process just described helps each 
student develop basic purposes for his study and tends to de- 
emphasize excessive emphasis on working merely for a mark. 

After the students have idendified a number of problems on the 
basis of the criteria previously listed, each, with instructor help, 
is encouraged to select two, three, or four problems and break 
each of them down into subproblems. An illustration of a problem 
with its subproblems that one student developed Initially is the 


following: 


é p a a acy the problem of individual 
ow can I meet with a greater degree of preguany d 
oen in my classes in ninth-grade ——— a.c 
1. What are my specific purposes in teaching the course and what are 
2. What are probable learner goals in taking 
possible ways of determining these? 
3. What plans should I consider m 
4. What are ways of determining a learner 
5. What are possible organizational setups. 
which might be used in developing the course 
7 ical str ; and weaknesses or 
6. What are psychological strengths anc natin 
various class procedures which other science teachers have pr p E AS 
7. What procedures tend to develop critical thinking and the "m 2 
of the scientific method to science problems in individuals of differing 
ability? | 
8. What are possible ways of evaluating course work and helping learners 
evaluate their work? 


aking before the classes meet? 

's science needs? . 
logical and/or psychological, 

? " " 

limitations of 


* For additional development of issues related to problem pn 
see: Ray H. Simpson, Improving Teaching-Learning Processes. New York: 
Longmans, Green, 1953, pp. 41-96. 
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9. What are ways of making group work more effective in promoting 


learning? 
10. What kinds of resources will we be likely to need and how should 


these be obtained? 


After the student has broken down two, three, or four problems 
into specifie questions or subproblems—each problem with its 
subproblems constituting a possible working outline—the teacher 
and other learners in the group offer suggestions for improving 
them. From this group of two, three, or four tentative outlines, 
the student is in a position to decide which problem with its sub- 
problems it would probably be best for him to study. After his 
decision has been made, the student may add, delete, restate, or 
reorder the subproblems. This refined statement of a major prob- 
lem and its subproblems constitutes his basic organization or 
psychological frame of reference for carly study. Several problems 
may be attacked by a student during the course of a semester. 


Working Toward Problem Solution 

After the learner has organized his subproblems into a tentative 
outline, he is in a position to start looking for solutions and to 
react critically to ideas which are gleaned from various resources. 
It is emphasized by the instructor that for any one educational 
problem there is no single “best answer" since the teacher and the 
learners differ significantly in their needs or abilities in different 
situations. For example, different approaches to discipline are 
usually called for in à rural neighborhood and in an urban slum 


district. 


An attempt also is made to help the learner see that there are 


usualy arguments for and against each proposed solution, and he 
should become familiar with these arguments if he is to discuss 
the issues involved intelligently. Important in the understanding 
xsideration of the psychological principles 
ach is based. A study of these principles 
arner build up à theoretical framework 
more valid judgments in other realistic 


of the arguments is à cor 
or theories upon which e 
and theories helps the le 
upon which he may make 
problem situations. 


Keeping a Student Learning Record 
In the class in which a specific attempt is made to encourage 
the development of more effective learning processes, it is im- 
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portant that a rather complete record be kept by the student 
of his learning activities and ideas. Such a comprehensive record 
is useful from several standpoints. First, it enables the student 
to clarify his plans and ideas and the information acquired by 
writing them down in his own words. Second, it gives the teacher 
and other students an opportunity to examine and offer sugges- 
tions about the progress of his work in the course. Third, it pro- 
vides an objective basis upon which, with the guidance of the 
teacher, he can evaluate the work that he has done to date and 
make plans for future activities, 

The students are usually encouraged to keep a record of the 
following sort. Each should have an orientation section which in- 
cludes a list of the problems which he has identified as important 
from his standpoint. This section should also include a breakdown 
of several problems which he has considered for initial study. 
In the orientation section of the learner's record there is also a 
list of the comments, criticisms, or suggestions which have been 
made by the instructor or by other students, The orientation 
section is usually followed by an outline of the main problem 
which he is attacking together with the subproblems under it. 

Each learner keeps a resource sheet on which he lists, usually 
by number, all resources which he has found useful in connection 
with the course. After completing the rias e the learner 
prepares a number of pages at the top of ma h " w hich he writes 

bproblem. As he finds possible solutions of his sub- 
c a > hese on the appropriate pages. The origin of 
problems, he lists these " d by a number which refers to 
each possible solution is indicated by i : 
> ‘ 2 bibliography or on the resource 
the loeation of the item in the b g 


sheet. ; i rom a 
Under each subproblem, after each FÉ 2 beret and 
specific source, the learner writes his own critica niis 
reactions to the ideas he has obtained from various ae ^ own 
primary purpose of encouraging learners to put down thei : 
ideas is to develop critical thinking by each student. The various 
parts in the record, which are labeled “own reactions, include: 
one’s own professional situation, relating theory to practice, re- 
lating practical suggestions to one or more theoretical considera- 
tions, pointing out weaknesses of proposals that have been given, 
Suggesting types of situations under which the proposals might 


— — 
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work, and types of situations under which the proposals probably 
would not work. An attempt is made to urge students to go beyond 
such superficial comments as, “That looks good,” or “I like that,” 
or “it doesn’t seem to me that there’s anything in that,” to ex- 
pressions of reasons and a critical analysis of why the source or 
student takes a particular point of view. 

Periodically, the learner’s record is examined by the instructor 
and by other students in the class. The paramount purposes of 
ons on the part of the instructor are to get a general 
hat each individual learner is doing, and to 
stions which will enable him to improve 
his work as he plans additional activities in the course. The peer 
evaluation provides helpful suggestions both to those examining 
a student record and to the student whose record is examined. The 
written evaluations and self-evaluations of records which students 
give are sometimes examined by the instructor. Through recip- 
rocal student evaluations of records, the learner gets guided 
practice in offering help to other students. . : 

Periodically, learners are encouraged to arrive at tentative con- 
clusions with respect to particular subproblems that have been 


rather thoroughly investigated. 


such examinati 
understanding of w 
give comments and sugge 


Self-Evaluation 

Self-evaluation is conceived as making judgments concerning 
present and past activities and achievements so that better plans 
may be made for future learning. Thus conceived, self-evaluation 
is encouraged from the beginning of the course when the learner 
starts to identify problems of significance to him. Evaluation is 
a continuous process throughout the course, and at frequent in- 
tervals the student is encouraged to look over what he has done 
and, in the light of such an analysis, to plan future activities 
more effectively. At times the instructor and other students give 
guidance to the individual learner. One type of help in self-evalua- 
tion is to aid learners in setting Up criteria which may be used 
in evaluating the individual record. , 

Below is a list of criteria developed by a subgroup in one class. 
The questions constituting the criteria can serve as one basis for 
Self, peer, and instructor appraisal of student learning as shown 


in the record. 
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1. Are comments analytical, and do they show critical thought? 

2. Have sufficient theoretical ideas from psychological resources been 
gathered? 

3. Has a good balance of theoretical and applied ideas been maintained? 

4. Have theoretical ideas been related to practice? 


5. Have the theories underlying suggested practices been sufficiently 
examined? 


6. Are there types of resources which should have been used more? 
If so, what are they? 


7. Is the setup of the record flexible? 

8. What do you think should have been done differently? 

9. What types of things do you think you should stress more in future 
study? In what ways are these likely to improve learning activities? 

10. Does it appear that your study, as you have developed it, is likely 
to improve your professional behavior? 


In the course, several assumptions about evaluation are made, 
such as the following: 


The primary purpose of evaluation should be to help the learner 
see where and how he can become more efficient in learning. 

Learner goals and purposes in evaluation should be examined 
frequently and appropriate modifications should be made. Mental 
health is one of the paramount goals of self-evaluation, In the 
broad sense, no individual or group is likely long to remain 
mentally healthy unless efficiency in self-evaluation is developed. 
The teacher’s own self-evaluation can constitute an important 
basis for teacher and learner improvement. Evaluation of learn- 
ing activities in process is frequently more rena in improving 
learning than evaluation of the end products Bi eni since 
the former is the real basis for the latter and eimi d ud oe 
where improvement can best take place. cera 1 siehe ci 

s à w 

carried on in the course should not stop when a final gre a 
been obtained. " , 

Each teacher and each learner should attempt, ipiius, M 
help, to set up his own principles for evaluation, since it usually 1s, 
or at least should be, his own responsibility. All concerned should 
have a share in deciding the purposes and procedures of the evalu- 
ation. The student’s share should increase as he gains additional 
facility. Autocratic evaluation basically presents an impossible 
job for the teacher, because of the number of students. Also, when 
attempted, it is likely to be uneven in quality, not only because of 
the magnitude of the job, but also because the learner himself 
with sufficient guidance is in the best position to evaluate his own 
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M Individualization in education demands democratic evalua- 
ion. 

Evaluation should not be thought of simply as the judgment 
of an end product, but rather as a continuing process which always 
goes on concomitantly with other phases of learning. It is a con- 
tinuous appraisal and reappraisal for continuous reorientation of 
goals and activities. Nor should self-evaluation for learning pur- 
poses be one which allows students without guidance to do what- 
ever they feel like doing. An important part of self-evaluation is 
helping the learner build up appropriate criteria to guide him in 
making various types of decisions. Evaluation should be made 
up of frequent analyses of past work, and should be oriented 


toward systematic plans for changes in future work.? 
P g 


Finding and Using Resources. 

Students are encouraged to examine different texts, source books, 
and other resources during the course. One principle to be kept 
in mind in getting and using resources is that those resources which 
are appropriate for one learner in a class may be most inappro- 
priate for another learner in the same class. Some factors contrib- 
uting to this principle are differences in needs, attitudes, skills, 
and basic abilities, and in what is considered usable material. The 
instructor has found that many students need much help in finding 
and using resources. Many students have been trained to depend 
almost exclusively on instructors and hence must be weaned gradu- 


ally. 


Purposes in using texts differ somewhat from those in conven- 


tional courses. Several texts are used rather than one. Each student 
uses different texts on à reciprocal basis with other students. One 
purpose is to help them develop the habit of referring to appro- 
priate sources in problem situations. Informal studies of what 
teachers typically do in a professional problem situation indicate 
that such a habit is not currently common. A second purpose is to 
aid learners in practicing the skills necessary in getting help from 
educational psychology texts. Extensive use is made of tables of 
contents, introductory paragraphs in chapters, chapter summaries, 
book indexes, and other key parts of books. A third purpose of use 
of texts is to give practice in critically examining the theoretical 


? For additional discussion and illustrations relating to self-evaluation see: 


Simpson, op. cit., pp. 173-229. 
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statements and practical suggestions in the texts. Initially, stu- 
dents may shy away from this, being loath to be critical of text- 
books which they have come to regard almost as sacred cows. An 
attempt is made to help the learner sce that through a critical anal- 
ysis of the thinking of others his own thinking is sharpened. 

Different texts are exchanged from time to time by individual 
learners and they attempt through use of the tables of contents, 
chapter introductions, chapter summaries, and indexes to use 
those parts of each book which seem to give the most theoretical 
or applied help on their particular problems. 

The resources used by a student should depend upon the prob- 
lems for study which he has selected, with the help of the teacher, 
and also should result partially from guidance given by the teacher 
during the course in the acquisition, selection, and use of resources. 
The instructor should have available a variety of bibliographies 


which are frequently used as “starters.”” Much utilized are Psycho- 
logical Abstracts and The Education Index. 


Students make considerable use of human resources, such as 
other students, teachers on the job, other instructors, and, of 
course, the instructor himself. They are also encouraged to use 
many other resources such as observations of classrooms in the 
public schools, and visits to youth groups. An annotated list of 
readily available films is given to each student and groups of 
learners are encouraged to view some films in the visual aids labo- 
ratory. Some of these films that small groups of students recom- 
mend highly are shown to the whole class at a later date. : 

Furthermore, they are encouraged to get practice in evaluating 
the ideas obtained from various resources and always we Pa 
giving critical reactions to these ideas. Some Fiber bm a : 
that, while the objective of such work is desirable, stu E 3 s i 
capable of doing it. It is here maintained that compe enge 
veloped through guided practice in these activities. 


Developing Effective Interpersonal Relations 


One of the critical points in our culture seems to relate directly 
to interpersonal relations weaknesses. We believe in democracy 
and in some respects we are democratic. However, many school 
situations are characterized by essentially autocratic practices. A 
functional educational psychology must face the problems tied up 
with the question: Do educational and social psychological re- 
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searches indicate the need for a study and use of democratic inter- 
personal relations? Most of the evidence seems to suggest an af- 
firmative answer. Therefore, since the course is considered a 
teaching-learning laboratory, attention is given to ways of study- 
ing and using democratic interpersonal relations. In most cases 
those who are affected by a particular decision should have an in- 
creasingly large share in making the judgments which lead to that 
decision. 

Beginning with the first class meeting, an attempt is made to 
get participation by all students in planning and managing the 
work of the class. As the course progresses, the instructor attempts 
gradually to shift more and more social control and responsibility 
for learning activities from himself to the students. While early 
assignments are made largely by the instructor, these are of such a 
nature that the individual learner can adapt them to his needs 
and abilities. For example, an early assignment, in brief, might be: 

List specifie professional problems related to teaching-learning situations 

which you have encountered as a student or teacher. Supplement this list 
by studying professional periodicals which take up problems in your field 
(for example, English teachers or prospective English teachers might use 
The English Journal). After each problem you list, indicate the Situation or 
specifie resource which suggested the problem to you. 
In one sense all students have the same assignment; however, it 
is set up in such a way that each learner can adapt it to his back- 
ground, interests, and needs. If students had had a more desirable 
type of training in individual responsibility for learning activities, 
it is quite possible that a more rapid start could be made in giving 
them primary responsibility for setting up their own learning ac- 
tivities. However, in view of the fact that learners most often 
come to the class expecting to be told what to do, when to do it, 
and how to do it, it is necessary, in line with principles of readiness, 
to work with the learner where he is and get him started off in 
assuming responsibility gradually. : 

The following are some of the approaches which have been 
found useful in establishing effective communication between the 
students and the instructor and among the different class mem- 
bers: An advisory committee from the class may help the in- 
structor plan the framework for in-class and out-of-class activi- 
ties, This group may be selected in a variety of ways, any one of 
which may be the best in à particular situation. Sometimes the 
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advisory group is made up of volunteers. In other classes it has 
seemed advisable for the instructor to select representatives from 
various subject-matter or teaching-level groups if the class be- 
lieves that would be most desirable. In some classes, where small 
groups within the class are formed carly and students have be- 
come acquainted, it is possible to have each group select a repre- 
sentative who serves on the advisory committce for a definite 
length of time. In other classes, where students have known each 
other, or each student has known several others, it is sometimes 
desirable to elect representatives. When nominations and elections 
are used, such questions as these are sometimes raised: What kinds 
of students should we try to elect? For example, should only the 
best students be elected or should we elect some who are weak (as 
determined by self-evaluation, teacher evaluation, or other-person 
evaluation) in this type of activity so that they can get practice? 
Students usually decide it is wise to have some of both types. 
Typically, it is probably desirable to discuss the method of choos- 
ing the advisory group so that the students may get some expe- 
rience in this kind of learning activity. 

In large classes, frequent conferences between the instructor and 
individual students are not possible. As a substitute, a written 
communication system can be introduced. On a cover sheet of 
the learner’s record, he can note down his problems and difficulties. 
The teacher can collect a certain number of learner records with 
attached comment shects cach day or each week. Between class 
periods the instructor examines them and returns the records with 
his comments on the cover sheet at the next class period. These 
may note learning strengths shown by the record, weaknesses in 
organization, or other criticisms, and may suggest additional ae 
proaches or resources, This arrangement permits the teacher to 
look over the learning record of each student, see what he is doing, 
and offer constructive suggestions to the learner. It also permits 
the student to indicate to the instructor what he believes to be his 
primary difficulties. In addition to regularly scheduled inspections, 
it is debatable whether it may sometimes be desirable to have un- 
scheduled inspections by the teacher. They permit the teacher to 
see work in process rather than a record “fancied up" for the 
HIM and so encourage the student to get help on a voluntary 

asis. 


Time is occasionally given in helping learners develop in the 
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various roles that a mature group member uses at various times 
in classes and small groups. Roles include such ones as are involved 
in giving leadership to an unstructured group. In some classes a 
bibliography which deals with individual roles in a group and on 
problems of “growing a group” is given to students. Individual 
members study various items of the bibliography and also look up 
other references of a similar sort and then report back either to 
the class or to small groups in which they may be working. 

It must be recognized that the teacher is in a key position as 

major class leader for guiding group activities toward democratic 
values and sound procedures. The democratie teacher is able to 
take upon himself a great variety of roles in the group. In any par- 
ticular situation, he attempts to assume the role which promises 
to be of most value to learners. This will sometimes be a good 
listening role. To promote a situation in which learners can prac- 
tice different healthy interpersonal roles, it is necessary for the 
teacher to depart from some roles traditionally played and assume 
many different roles at different times. For example, the teacher 
probably should not be thought of as knowing all the answers to 
the realistic problems that can come up in the course of discussion. 
The teacher's job, rather, is one of knowing a lot of answers and 
also of knowing how to help learners develop ways of assuming the 
responsibility for getting and analyzing possible answers them- 
selves. 

Typically, the learner's concept of his responsibility for class 
operation needs to be changed. The idea is current that the teacher 
is almost solely responsible for the whole class and for individual 
and small group success. Added learner responsibility would seem 
to be essential for significant improvement in large and small 
group activities as well as in individual learning. 

One aspect of interstudent communication is the setting up of 
groups. These groups may be set up for different purposes. Some 
of the purposes which have been found significant are: 


To answer each other’s questions with respect to the operation of the 
course and the development of individual learning activities. 

To offer critical and constructive suggestions on the learning record of 
other members of the group. 

To develop, as a group, various aspects of a particular problem. For 
example, students in one group might be working on the question: What 
are some of the ways of diagnosing disciplinary problems and improving 
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them? One student might work on the matter of improving assignment mak- 
ing, another on the ways of getting and using appropriate resources for 
class activities, and another on the issue of getting democratie learner par- 
ticipation in class activities. 


Some consideration has already been given to evaluation. It is 
important in this connection that both the teacher and the learner 
give consideration to ways of evaluating their individual behav- 
iors in interpersonal situations and in planning improvement in 
such behaviors. 

Class cooperation is sought not only in developing plans for in- 
class and out-of-class activities, but also in setting up evaluative 
criteria which will be used in promoting improvement in the class, 
and as a basis for marking where marking is either required by 
the system in which the class is operating or is felt desirable by 
both the teacher and students. When marking becomes a dual re- 
sponsibility of the teacher and learners, it is more likely that an 
attempt will be made to align the basis of marking with the types 
of activities which will promote profitable learning. This means 
that the teacher and students must together examine and re- 
examine the primary purposes of the course and the degree to 
which these purposes are being achieved. 

In the light of such purposes, the teacher and the students can 
cooperatively set up criteria on which the marks will be based. 
Such criteria may differ quite significantly from one course to 
another, depending on its purposes. For example, if it were a course 
in appreciation of literature, the primary criterion on which mark- 
ing would be based might be: Has there been improvement in the 
appreciation of literature defined in behavioral sie Mere 
semester or year and, if so, to what degree? On the ot ES i (eR 
it is a professional course such as educational psycho ogy Wi ^u 
psychology of learning for teachers, a major criterion might be: 
To what level of professional development has this learner grown 
by the end of the period of study—if the mark is designed to indi- 
cate level of professional competency. 


Summary 


The following points represent what the writer believes to be 
crucial aspects of the course which has just been described. 

Continually, attempts are made by the instructor to have each 
member of the class voluntarily bring his intelligence to bear upon 
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the total problem of utilizing psychological resources in the iden- 
tification and the solution of significant professional problems— 
significant particularly to the individual learner. Generally this 
process involves setting up goals and purposes, deciding on courses 
of actions, and attempting a continuous evaluation of the work 
as it proceeds. 

The course for an individual learner is not rigidly set up in ad- 
vance. In effect, the learner produces a custom-tailored set of 
learning experiences with the guidance of the teacher. 

Goals, study procedures, class procedures, and other factors in 
the management of teaching-learning situations are kept flexible 
enough to give considerable attention to the development of learner 
judgment. 

Learning for the student consists of a combination of individual, 
small-group, and whole-class activities which are planned by those 
most concerned. 

The educational psychology course is, itself, used as a teaching- 
learning laboratory where informal tryouts of some proposed so- 
lutions to educational problems can be made. 
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SEX DIFFERENCES AMONG TEACHERS IN 
THE USE OF THE CRITERIA OF 
ADJUSTMENT’ 


Harry Berwin AND Emmy WERNER 
University of Minnesota 


The criteria employed by teachers in evaluating behavior prob- 
lems of children has been a matter of much concern since the clas- 
sic study by Wickman reported in 1928 (25). In that study, the 
attitudes of teachers and “mental hygicnists” were contrasted in 
regard to a list of children’s undesirable traits and behavior. The 
results indicated greater teacher emphasis upon classroom man- 
agement, learning, and authority problems; clinicians placing 
greater emphasis upon withdrawing and other "unsocial" per- 
sonality characteristics. 

The report of these findings is regarded as having had a pro- 
found effect upon teacher education programs (6) with directly 
measurable changes in teachers’ attitudes evident as early as five 
years after publication of the study (1). To say the Wickman 
study itself was responsible for the reversal of emphasis in teacher 
education would not do justice to the tenor of the times which the 
spirit of the report reflected: namely, the growing awareness among 
many groups of the psychological approach to the understanding 
of human behavior. This may account for the astonishment of 
some observers (e.g., 6) that the Wickman study should have been 
so widely quoted and uncritically accepted when it embodied some 


basic flaws. 

The limitations of the investig 
ing three general problems: methodology, 
terion, and the consistency of behavior related to 
problem of methodology is not independent of the others. The 
criticisms are in essence these: 

1. Methodology. The directions given teachers and clinicians 


ation have been noted as involv- 
the adequacy of the cri- 
adjustment. The 


PR 
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were different in the time to respond and the emphasis upon 
present seriousness of behavior as against seriousness for future 
adjustment. The differences in directions should affect the com- 
parability of results for both groups. 

Replications of the study and related investigations have em- 
ployed the same directions for teacher and clinician (or other) 
groups, using the original directions given clinicians (7, 12, 20) 
or those to both groups (4, 18, 20). There have been replications 
(e.g., 25 years later) retaining the original design (20) as well as 
introducing a variety of modifications (13) including changes in 
the time-scriousness orientation (7, 9). The results, in the main, 
indicate that the directions given S’s do account to some extent for 
the differences obtained between groups, but disparities still 
emerge. With the passage of time, however, there has been a ten- 
dency for teachers’ criteria to more closely approximate those of 
clinicians. 

2. The criterion problem. The second issue concerns the choice 
of clinicians as a group against which to refer teachers’ attitudes 
toward children. Watson (23) has pointed out that there is no 
reason to suppose clinicians to be “correct” and teachers not, rather 
than vice versa. Wickman is criticized for not even considering 
the possibility. Other investigators (6, 10, 19) have dealt with 
this issue although not with the attention it deserves. It has been 
our thesis (2) that the criteria of adjustment are determined, at 
least in part, by the status and role of the person who evaluates 
the adjustment of others, and holding to a set of criteria is not 
inherently more correct for one group than another. The adequacy 
of criteria (of adjustment) will depend upon some independent 
standards related to the values and goals of those making use of 
the criteria. Since teachers may have very different goals from 
clinicians regarding the behavior of children, it may not be ap- 
Propriate to impose the clinicians’ goals upon the teacher. . 

3. Continuity of behavior related to adjustment. The last major 
limitation of the original study was its too ready acceptance of a 
direct relationship of withdrawing behavior in childhood to mal- 
adjustment in adulthood (14, 23, 25). No adequate data were 
available to support this assumption in 1928, and one may venture 
to say no definitive data are available now to substantiate it. As 
we have observed elsewhere (3), a too ready acceptance of this 
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assumption has tended to hamper success in studies of the predic- 
tion of adjustment from childhood to adulthood. 

In the present study, we are concerned with the problem of the 
criterion. In exploring further the relation of role and status to the 
criteria of adjustment, we are focusing on the effect of sex roles. 
It has been suggested by Davis and McGinnis (6) that the differ- 
ences between teachers and clinicians may have been due to sex 
differences (since in 1928 teachers were more likely to have been 
women; clinicians more likely to have been men”). 


Procedure 


In the present investigation, verbal descriptions from men and 
women teachers of the behavior of adolescents judged by them 
to be well or poorly adjusted were obtained. This procedure differs 
from Wickman’s in that teachers are not provided with a set of 
behavior descriptions but offer their own. The present approach 
or modifications of it have been employed by others in investigat- 
ing teacher attitudes (5, 13, 16, 17, 26). The present study dif- 
fers from most all those cited in that the same amount of emphasis 
is placed upon the criteria of adjustment as on maladjustment. 

Thirty-nine teachers (17 women, 22 men) of the high school of 
a Minnesota community of 8,000 were interviewed as part of a 
series of studies in the prediction of adjustment. For the purpose 
of the present investigation, each teacher was asked to nominate 
the three best and the three poorest adjusted students (regardless 
in that school. The selection of high school teachers as 
mited the age range of the persons they 
then specifically asked: “Consider- 
nated as best adjusted, 


of sex) 
evaluators purposefully li 
evaluated. The teachers were 
ing the three people whom you have nom s 
bat is fhe toe them as a group or individually pes ers 
them stand out as best adjusted?” (The same was then asked with 
regard to the persons nominated as poorest adjusted.) 

The responses of the teacher-evaluators were recorded, content 
analyzed, and the frequencies of responses for each category tabu- 
lated according to the sex of the evaluator. Percentages were com- 
puted and are presented in Tables 1 and 2. A critical ratio formula 


* Thompson (22) reports that women teachers’ rankings agree somewhat 
better with those of child psychologists than do the rankings by men teachers 
(data however are not presented). Epstein's study (8) is also concerned with 
sex differences in ranking behavior problems of children. 


TABLE 1 


DIFFERENCES BETWEEN MEN AND Women Hicu Scuoon TEACHERS 


IN MENTION OF CnrTERIA OF Goop ADJUSTMENT 


Criteria 


3.0 Social-interpersonal (total) 


1.0 Emotional-personal (total) 


1.1 mature, good judgment, sensible, sound 

1.2 pleasant, likeable, sweet sunny disposition, good 
personality, personality 

1.3 adaptable 

1.4 general attitude is good, healthy attitude 
toward marriage, sex, surroundings, other per- 
sons 

1.5 are secure, don't show off, don't complain, not 
self-conscious 

1.6 well-integrated, emotionally stable, well-bal- 
anced, calm, not tense, poised 

1.7 accepts criticism, has insight into self 

1.8 vivacious, lively, enthusiastic, energetic, glow- 
ing 

1.9 happy 


2.0 Character-control (total) 


2.1 good manners, show respect for authority, atten- 
tive, courteous, polite, no discipline problems, 
well-behaved, accept mores of society — 

2.2 takes responsibility, accepts responsibility, as- 
sumes responsibility beyond call of duty, trust- 
worthy, dependable, reliable : 

2.3 honest, sincere, straightforward, genuine 

2.4 are not led, have principles, mind of their own, 
independent, self-sufficient, real strong individ- 
ualist 

2.5 keep themselves decent, 
good character, don’t get in 
conduct o.k., tries hard to 
narrow path 

2.6 humble, modest 

2.7 unselfish, altruistic, 
do good for others uu 

2.8 good at getting things done under trying Cir- 
cumstances, handles difficulties well 


have not wandered, 
to difficulties, moral 
follow straight and 


wants to serve fellow men, 


3.1 charming, meets people well, likes other people 
to be comfortable, ability to carry on conversa- 
tion, good mixer, friendly, extrovert, fits in well, 
gets along well, hosts of friends, well liked, pop- 


ular 


Mese 
Q = 17) 


5.8 


88.2 
70.6 


Male 
(N = 22) 
% 


27.3* 


oo 
=. o 
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TABLE 1—Continucd 


Female Male 
(N21) (N= 22 
Criteria % % 
3.2 leader 35.3 27.3 
3.3 cooperative, willing 11.7 13.6 
3.4 aggressive, lots of guts with people 0.0 9.1 
3.5 participation in extracurricular activities, com- — 23.5 9.1 
munity activities 
4.0 Achievement in general and specific cultural roles 88.2 72.7 
(total) 
4.1 ambition to make good, wants to get ahead, 58.8 36.4 
doesn’t let opportunities goby, are goal-oriented, 
know exactly what they want, planful, lots of 
determination, drive, initiative, sticks to things 
and accomplishes them 
4.2 good students, use their ability, like school, high 41.2 18.2 
scholastic standing, get good marks 
4.3 not necessarily the best or good students but — 11.7 9.1 
they try 
4.4 does good job, carries on job very successfully, 17.6 922.4 
terrific worker, tends to business, doesn't loaf, 
works well independently, industrious, efficient 
at job 
5.0 Abilities and interests (total) 52.9 45.4 
5.1 all round person, variety of interests 23.5 18.2 
5.2 have ideas, think for themselves, logical think- — 11.7 13.6 
ers, inventive, creative ' u 
5.3 bright, alert, capable, superior, brilliant 29.4 13.6 
6.0 Appearance (total) 5 4.5 


attractive, very gracious, very fine dresser, carries 
herself well, neat appearance 


* Significant at the 10% level of significance. 


for independent proportions (11, p. 60) was used to determine uig 
significance of the differences in response by sex to each category. 
The tables, then, indicate the proportion of each teacher group 
(male and female) that employed each criterion and where sig- 
nificant differences appear. The specific criteria are grouped into 
general areas: social-interpersonal, statements reflecting the re- 
lationship between the S and others; emotional-personality, state- 
ments which indicate adequacies or inadequacies in adjustment 
in which the social reference is lacking or minimal; character- 
control, statements which indicate the extent or adequacy with 


TABLE 2 


DIFFERENCES BETWEEN MEN AND Women HIGH SCHOOL TEACHERS 
IN MENTION OF CRITERIA OF POOR ADJUSTMENT 


Female Male 
Criteria i $1 e mo 

1.0 Emotional-personal (total) 82.4 81.8 
1.1 seeks attention, shows off, wants to be something 28.5 27.83 


he isn't, bluff exterior only 
1.2 persecution complex, suspicious of adults, feels 17.7 27.8 
people are picking on him 
1.3 dissatisfied with things as they are, looking for 5.8 4.5 
something they haven’t got 
1.4 emotionally unstable 41. 
1.4.1 unstable, inconsistent in moods 1 
1.4.2 violent temper, bad temper, uncontrolled, 
purely emotional, hot-headed, high-strung, 
quick-tempered 
14.3 nervous, can't settle down, restless, moves — 23.5 9.1 
around, tense, lives in constant fear 
1.5 resents suggestions from others, reacts poorly to 0.0 18.2 
criticism 
1.6 stands out as different from others, a queer ducka — 11.7 9.1 
little bit odd, strange 
1.7 insecure, lacks confidence in self, afraid to say 11.7 
wrong thing, seeks praise, ill at ease, always full 
of alibi 
1.8 stutters when excited, one complaint after another 
1.9 immature, acts like baby, kind of young 1. 
1.10 unhappy, gets no enjoyment from things average 1 
people do 
1 smiles inappropriately, 5 
2 makes fool of self, others feel sorry for him 
0 


La 
1.1 
1.13 daydreams, reads escapist books 
1.14 sissy, feminine characteristics, possibl 


silly sneer on face 


esex prob- 


lems 
1.15 no spark Bi 
1.16 sadistic, hurts animals 6 
2.0 Character-control (total) 82 
2.1 unreliable, not dependable, e E 0 
sibility, untrustworthy, tatler, hes 
2.2 hard kid ok Ie vers ecm, tate disciplinary 08.8  81.8* 
problems, negativistic, hostile to authority, defi- 
ant, cocky, smart alecky, belligerent attitude, in 
conflict with mores, rebels, poor attitude, chip on 
shoulder, wise guy, trouble maker, hostile, gets 
into trouble in school, mischief maker, poor con- 


duct 


ks sense of respon- 
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TABLE 2—Continued 


Female Male l 
Pee (N = 17) (N = 22) 
Criteria % % 


2.3 boisterous, foolishness and joking all the time, 35.3 22.7 | 
conduct themselves poorly in public, loud, noisy, 
discourteous, coarse no manners 
2.4 very dependent, not self reliant, can’t stand on 5.9 18.2 
own two feet, can’t make decisions, mamma’s boy, 
conformist, conforms precisely to accepted pattern 


2.5 runs with wrong crowd, easily led by others in 0.0 4.5 
crime 
2.6 too independent, goes own way, grows up as sees 11.7 0.0 
fit 
3.0 Social-interpersonal (total) 70.6 63.6 
3.1 lone wolves, withdrawn, unresponsive, don’t com- 35.3 — 45.5 


municate, shy, timid, very quiet, passive, aloof re- 
served, not comfortable with people, typical wall- 
flower, mousy, standoffish 
3.2 lack of adjustment to social groups, not liked, no 41.2 35.3 
friends, not accepted in group, not accepted by 
others, get along poorly, don’t attract people, not 
friendly with people 
3.3 are uncooperative with peers 0.0 
3.4 self-centered, snobs, envious of others, critical of 17. 
others, overbearing 
3.5 overly aggressive, domineering, too bossy, bully 5. 4.5 
4.0 Lack of achievement (total) 47.1 03.6 
4.1 nogoals, no desire to get ahead, no purposefulness, 23. 45.5 
doesn’t know what she wants, just rides along, just 
rolls along, no get up and go, no ambition to get 
anything done, lazy, no drive for self-improve- 
ment, can’t buckle down, lack ce don’t apply 
never stick to anything 
4.2 M nn contributed little at school, re- 23.5 27.7 
peating grade, no interest in school, aol m 
4,3 can’t read, can’t spell, trouble in expressing him- 


self | 
41.2 18.2 
50L ili j t (total 
ack of ability and interest ( ) not bright, 29.4 13.6 


5.1 lack of ability, low scholastic ability, 
slow to learn, doesn’t seem to understand any- 


m 


2 


e 


thing, not much on the ball a 
5.2 bored, uninterested, dull, inattentive 11.8 9.1 

6.0 Appearance (total) NE p 
6.1 physical handicap, long-term illess, physically 11.7 9.1 
6.2 not too attractive, slovenly appearance, dirty, 5.8 et 


personal habits poor, nothing glamorous, dresses 
inappropriately 


* Significant of the 10% level of significance. k. 
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which the individual has incorporated social controls into his rep- 
ertory and the traits developed in the process of doing so; abilities- 
capacities, statements referring to S's abilities, capacities, and 
skills; achievement, statements which indicate adequacy or in- 
adequacy with regard to achievement generally or to specific cul- 
tural roles, e.g, school, job; appearance, statements concerning 
personal appearance. 
Results and Discussion 


The reliability of the categorization process was determined for 
this series of studies employing two psychologically trained raters 
ercentage agreement for the classification of 


as classifiers. The p 
specific criteria was 85%. Agreement with regard to area classifi- 


cation was 91%. oo 5 : 
For six categories (out of 83) there appear significant sex dif- 


ferenees in frequency of response. Since the 10% level of signifi- 
cance has been taken as the cutoff point, it is apparent that the 
number of significantly differentiating items could have occurred 
by chance alone. The fact that the pattern among the differences is 
consistent with other results that have been obtained, however, 
suggests that these particular differences may be meaningful. The 
most significant observation to be made concerning the results, 
however, is that male and female teachers are to à high degree con- 
sistent in their employment of the criteria of adjustment. 

The differences which do emerge suggest that male high school 
teachers are more inclined to value maturity, good judgment, de- 
pendability and trustworthiness. They tend, as well, to charae- 
terize the best adjusted as secure persons and lacking in self- 
consciousness. 

Women high school teachers, on the other hand, place greater 
emphasis on those items classified together as "good character" 
items, particularly humility and modesty. In addition, and prob- 
ably most characteristically, women teachers place more stress 
upon negativism, hostility to authority (the teacher), discipline 
problems, etc. as criteria of poor adjustment. uu | 

As already indicated, in all other respects the criteria which are 
employed in describing the least and the best adjusted high school 
students are essentially the same for both men and women teach- 

quency of the criterion variables 


ers. In assessing the relative fre c 
(by ranking) for each sex by area, some of the findings already 
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indicated receive further support. The most important area men- 
tioned for both good and poor adjustment by women teachers are 
the “character-control” (essentially, classroom management) 
items. For the male teachers, however, the “emotional-personality” 
items receive prime mention. Achievement and ability variables 
are mentioned quite frequently (as much as 80%) by the high 
school teachers in characterizing those best or poorest adjusted. 
These results tend to lend support to the hypothesis that there 
may be some sex differences in adjustment criteria and in the 
direction suggested by Davis and McGinnis, namely, greater em- 
phasis by men on the emotional-personality criteria; greater em- 
phasis by women on the character-classroom management criteria. 
Considering the possibility these results could have occured by 
chance alone, however, makes these findings only suggestive. 


Summary and Conclusions 

Considerable agreement was found between men and women 
teachers in the criteria used to characterize best and poorest ad- 
justed high school youth. T 

The criteria which are used significantly more by men teachers 
are maturity, good judgment, dependability, trustworthiness, lack 
of self-consciousness, and being secure as à person, 

The criteria employed more characteristically by women teach- 
ers are humility and modesty. They also place greater stress in 
regard to poor adjustment upon negativism, apri ri authority, 
discipline problems, and getting into trouble in sc em 

In general, the "character-control" items an a g 0 oe 
often mentioned as criteria by women m emo pe 
sonality" items as a group by the men 2l — 

Intellectual and achievement variables play an A unm 
in the assessments teachers make of the adjustmen y 

eople. : 
li These results are considered in light of a sex difference pag 
sis concerning the differences between clinicians and teachers ob- 
tained in the Wickman study. 
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CHRONOLOGICAL DIVISIONS OF THE 
LIFE SPAN 


Horace B. ENGLISH 
Ohio State University 


. Most developmental psychologists object to the notion of dis- 
tinct “stages” of development. Yet it has been almost universally 
found convenient to describe individuals by such terms as child, 
adolescent, adult. To find, for the Dictionary of Psychological and 
Psychoanalytic Terms, how these should be defined, I sent à simple 
open-ended questionnaire to the fellows of the APA Division on 
Developmental Psychology, asking them to state the chronological 
limits applicable for these and similar terms. The freedom of ex- 
pression encouraged did not make for simplicity of tabulation; 
but, as hoped, a fairly clearcut consensus as to usage Was revealed. 
One hundred and sixteen questionnaires were sent out, 74 were 
returned. 


Table 1 sets forth the limits which best meet the consensus for 


each named temporal division, and the percentage of replies favor- 
ing those limits. In every case there is a mode in the distribution, 
but in a few cases a secondary mode shows that a divergent prin- 
ciple of categorization is being employed by a minority. In most 
instances, however, it is the adjacent categories which are next 
most favored, suggesting that the minority would easily accept 


the modal usage. . f 
The numerous comments are i teresting. “Neonate” is taken as 
“Newborn,” and many favor its abandon- 

essary. (Amen to that, and the same 
atin neologisms!) The *Gang Age" 
fers to a “stage” in development, 
if it means anything, which many respondents doubt. There is no 
consensus as to when it occurs and little desire to pin it down. 
“Puberty” is also rejected from this list; it is a physiological con- 
dition, not an arbitrary division of the life span to be defined by 
usage or consensus. Unfortunately, I assumed that adulthood and 
maturity would be used synonymously and did not list the former 
term. A few respondents, however, seem to need a period of “early 
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a complete synonym for 
ment as pedantic and unnee 
for a lot of similar Greek and L 
does not belong in the list; it re 
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TABLE 1 
Divisions or THE Lire Span 


Infancy Childhood 
Fr. Birth to 11.9 Mos. l Yr. to 11.0 
Yrs. 
(82%) (46%) (83% (41%) 
$ E Late Childhood 
Newborn Geo Early Childhood Mid-Childhood| (pre-adoles- 
nate) Birth to (young child) ~ 
6 to 9.9 Yrs. cence) 
0.9 Mo. 1 Yr. to 5.9 Yrs. (22%) (25%) | 10 to 11.9 Yrs 
22% ^ E: . 
(100%) (42%) (30%) (50%) (35%) (60%) 
Adolescence Maturity | Old Age 
12 Yrs. to 20.9 Yrs, 21 to 65 Yrs.| 65 Up 
(60%) (42%) (82%) (24%) | (47%) 
Early Adoles- | Mid-Adoles- Late Adoles- 


cence cence cence 
12 to 13.9 Yrs. | 14 to 15.9 Yrs. | 16 to 20.9 Yrs. 
(50%) (44%) | (50%) (45%) | (35%) (45%) 


— eee ea ie BRE, 


adulthood” between adolescence and maturity; they then define 
the latter as beginning at 30 or even 40. Most, however, are content 
to begin maturity with the twenties. Perhaps the Serontologists 
would be inclined to suggest a later age than 65 as the conventional 
E On ean comments, however, were those expressing 
reluctance to accept a chronological bounding of any of these pe- 
E ite rightly, they insist that no exact psychological mean- 
tods, i i a io most of these terms. That, it seems to me, 
de: iy e a merely formal definition in chronological terms 
is best, Tf "n example, one defines adolescence as beginning with 
ma hae tie seems to imply a specific pattern of psychological 
Taia “characteristic” of the stage. Such a clearcut stage prob- 
ably does not exist. Discussions based on such a usage mislead the 
layman and do not help the professional. — " 
Let us frankly admit that all the distinetions are ar : € 
There is not one of the life periods which could not perfectly w 
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have its limits shifted forward or backward. Is a “newborn” an 
infant of less than 10 hours, of 10 days, of 14 days, or of what? It 
doesn’t matter which we say except that it is better for us all to 
say the same thing. There should be strong reasons given if one 
elects to deviate from the consensus of expert usage here set forth. 


Received May 9, 1957. 


A COMPARISON OF READING COMPREHEN- 
SION SCORES OBTAINED BEFORE AND 
AFTER A TIME ANNOUNCEMENT 


Desmonp L. Coox 
Purdue University? 


One of the most commonly used methods for measuring the rate 
of reading is to ask the reader to indicate, at a given time signal, 
the point he has reached in a given reading selection or passage. 
Rate scores obtained by this method are usually expressed in terms 
of the number of words read per unit of time, or some similar ratio. 

A variation of the above technique is employed on the reading 
comprehension test given to all new students entering the Uni- 
versity of Iowa. This test consists of a series of 10 reading pas- 
sages, each followed by several choice type questions designed to 
measure various comprehension skills and abilities. The total 
number of items is 80, and the time limit for the test is 90 minutes. 
At the start of the test, examinees are informed that at the end of 
45 minutes (or when one-half of the time has passed) they will be 
asked to indicate on the answer sheet the last item marked. The 
item indicated serves as a measure of reading rate and is so re- 
xc validity of such a procedure may be open to question since 

'e would not be a measure of pure reading rate. It is a 
siii oe the time spent not only in reading the passage but in 
TOARE 2 d eae the questions following each passage, Thus 
reading ssl fur two students to mark the same item but yet have 
it is possible jleted different amounts of the test. For example, 
€ ae have just completed marking item 42 at the time 
student A on ok student B has answered item 42 and gone on to 
te ee e. Under the procedure of obtaining rate scores, 
“eee ES mark item 42 on the answer sheet. Neverthe- 
en A e hum demonstrated to be an adequate measure of the 
ess, E 

i dents read (1, 8). 
- : sepe to believe that those examinees who had been 
E. dil? prior to the time announcement would speed up 


iThe data for this study were obtained h hor Associate 
} a 0 i y btained when the author Md as " "c . 
irec ve Examinations Service, State University of Iowa, Iowa y: 

Director, £ y C wa 
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in order to complete the test within the set time limit. Such a 
speed-up could result in lower comprehension scores for this group 
due to the fact that not as much time could be given to careful 
reading of the passages and answering of the questions. The faster 
readers meanwhile would be further along in the test and would be 
less concerned about speeding up to finish the test. They would 
have sufficient time to finish the test, and, consequently, the com- 
prehension score would not be affected to the same degree as for 
the slow readers. It is even conceivable that the faster readers 
might improve their scores since more time could be devoted to the 
remaining items. 

The relationship between the rate of reading and comprehen- 
sion has been the subject of a rather large number of studies. 
Traxler (10), Traxler and Townsend (11, 12), Tinker (9), and 
Stroud (6) present rather comprehensive summaries evaluating 
the research on this problem. Stroud reports that the average cor- 
relation between speed and comprehension is around .40 when 
timed tests are used. As Traxler points out, this condition is to be 
expected if both measures are secured from the same timed test. 
Using untimed tests, Stroud and Henderson (7) found that for 
elementary school children there was virtually no correlation be- 
tween rate and quality of reading. Using both timed and untimed 
tests with college students, Preston and Botel (4) report a corre- 
lation of .20 for untimed tests and .48 for timed tests, the latter 
correlation being significant. Usually these studies relating rate 
to comprehension have been concerned only with the relationship 
between these two variables and have not reported on the accuracy 
of comprehension involved. 

Specific study of the accuracy of comprehension scores obtained 
before and after time announcement used to secure a rate score 
on a timed test appears not to have been investigated. Flanagan 
(2), Anderson and Tinker (1), Tinker (8), and Robinson and 
McCollum (5) have approached the problem by noting compre- 
hension accuracy in set time limits, but they do not report com- 
parative results. Flanagan studied the relationship between rate 
and comprehension using twelfth-grade students trained to read 
at varying rates. Comprehension was noted to decrease slightly 
as the students went from a slow to a medium rate and much more 
from the medium to a rapid rate. When the students were divided 
into three groups on the basis of the slow rate, each group showed 
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about the same decrease in comprehension scores from medium 
to rapid rates. The results were obtained, however, using scores 
for the full testing time, and no evidence was reported on relation 
of accuracy for items attempted. Anderson and Tinker report the 
results of a study concerned in part with the number of items at- 
tempted and scores on these items in a standard time (i.c., part 
time) for various parts of the Iowa Silent Reading Test. A corre- 
lation of .88 was found for the composite score with regard to 
scores in standard time with iterns attempted in the same time. 
The tests were administered individually to 110 college sopho- 
mores. No comparison of comprehension accuracy before and after 
the standard time announcement was reported. Using tests of 
varying reading difficulty, Tinker found that comprehension at- 
curacy in a standard time varied from no difference to an aceuracy 
of response of 60-70% for the most difficult test. In this study, 
the standard time limits varied Írom test to test and ran from 
1% minutes to 36 minutes with students being allowed unlimited 
time to finish the tests. Again no results were presented regarding 
the comparative accuracy of response before and after the stand- 
ard time announcement. A study of relationship between accuracy 
of response to number of items attempted on the Paragraph Com- 
prehension part of the Iowa Silent Reading Test for 37 good and 
33 poor readers was made by Robinson and McCollum. The groups 
were selected on the basis of their comprehension scores on the 
total test. It was found that the good readers were usually superior 
to the poor readers in both rate and accuracy. Essentially the 
same results were obtained when the groups were given the Van 
Wagenen “A” and “Alpha” tests which were unie v The pues 
concluded that efficiency in rate of reading is the "re v m 
tial factor in reading test superiority. As was the pon ~ i 

studies cited above, the comprehension oe aa T ks e = 
ported for items attempted in either a standar or tota rine e 
the accuracy of response on untimed tests, with no comparison o 
accuracy before and after a standard time announcement on & 


timed test. 
Purpose 


sel i ; Ter 

The purpose of this investigation is to compare Sd ee . 
scores obtained before and after a time announcement for A z 
of slow and fast readers. The specific hypotheses to be teste 
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that the mean proportion of items correct before a time announce- 
ment is the same as the mean proportion of items correct after a 
time announcement for each of the groups of readers. This is es- 
sentially the null hypothesis, i.e., that the mean difference between 
scores before and after the time announcement is zero for each of 
the experimental groups. 

It is possible, of course, that any change in performance might 
be due to factors other than the time announcement. One such fac- 
tor might be increasing item difficulty as one goes through the 
test. Another factor which might tend to depress comprehension 
scores is fatigue. No direct measure of fatigue was obtained. There- 
fore, its influence can be considered only on a somewhat specula- 
tive basis in this study. 

Rejection of the hypothesis at the 1% level of confidence for 
the slow readers but not for the fast readers may be interpreted 
as evidence in support of a position that the time announcement 
does have some effect on subsequent comprehension scores. The 
exploratory nature of the study justifies the setting of such a high 


level of confidence (1). 
Procedure 


Three groups of 100 answer sheets each were selected from ap- 
proximately 1,500 answer sheets for the reading comprehension 
test administered in September 1955 using the percentile values 
assigned to the rate score as the criterion of selection. 

'The firs& group (Group A) consisted of examinees whose rate 
seores were above the 90th percentile and who had completed the 
test. The other two groups (B and C) consisted of slow readers. 
Group B was made up of examinees whose rate scores were below 
the 10th percentile and who had completed the test. Group C con- 
sisted of examinees whose rate scores were below the 15th per- 
centile but who did not complete the test. The original intent was 
to select Group C from among the examinees in the lowest decile 
for the rate score, but the lack of a sufficient number of incom- 
pleted papers in this area necessitated selection from the area 
below the 15th percentile in order to secure 100 papers. It was as- 
sumed that the individuals in Group C, by not completing the test 
after the time announcement, either did not speed up or, if they 
did, they did so to a limited amount. This group was used as a sort 
of a “control” against which the performance of the slow readers 
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completing the test might be better judged. No corresponding 
group of unfinished papers was available for the faster readers 
since most of this group finished the test. 

For each examinee in the three groups, the number of items cor- 
rect before the time announcement was determined and expressed 
as à proportion of the items attempted in that period. A similar 
proportion was also determined for items correct of items at- 
tempted after the time announcement. A difference score was found 
for each examinee by subtracting the proportion correct before 
from the proportion correct after the time announcement. By this 
procedure, negative differences would indicate that the individual 
did not do as well (i.e., had a smaller proportion of items correct 
of items attempted) after the time announcement as before. The 
median number of items attempted before and after the time an- 
nouncement was obtained for each group separately. The ¢ test 
for related measures was used to test the significance of the mean 
difference for each group. No statistical tests of significance were 
made for differences between the groups. 


Results 


The results of the study are presented in Table 1. In this table 
are given the mean proportion of items correct of items attempted 
before the time announcement for each group, the mean propor- 
tion correct of items attempted after the time announcement, and 
the mean difference for each group. The median number of items 
attempted before and after the time announcement for each group 
is also presented in Table 1. 

The results for the fast readers completing the test (Group A) 


TABLE 1 
IrEMs BEFORE AND ArTER TIME ANNOUNCEMENT 


Median Number 


Attempted Mean Proportion Correct 
Before After Before After g 
Group Time time time time Dif. 
.55 —.04 
A (N = 100) 63 17 .59 
B a = 100) 37 43 .53 .39 m o 
C (N = 100) 33 31 .58 .39 —.19 


** | significant at 1% level of confidence. 


| 
| 


— Cu 
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showed that they do read quickly as evidenced by the fact that 
over one half of the examinees had completed about 79% of the 
items before the time announcement. For this group, the mean 
proportions of items correct before and after the time announce- 
ment were quite similar. The mean proportion of items correct 
before the time announcement was .59. After the announcement, 
the mean proportion of items correct was .59. 'The average per- 
formance after the time announcement was in a negative direction 
but not statistically significant at the 196 level. 

The results for the two groups of slow readers are quite similar. 
The slow readers completing the test (Group B) tended to speed 
up as predicted in that 5396 of the items were attempted after, 
as compared to 47% before, the time announcement. The slow 
readers not completing the test (Group C) apparently were not 
greatly affected by the time announcement, as they completed 
33 out of the 80 items before the time announcement but only 
31 of the remaining 47 items after the time announcement. The 
slow readers completing the test had a mean proportion of correct 
answers of .53 before and .39 after the time announcement. Slow 
readers not completing the test had a mean proportion of .58 
before and .39 after the time announcement. Both groups showed 
a quite similar “Jack of success” after, as compared to before, 
the time announcement, the difference being less for slow readers 
completing the test than for slow readers not completing the 
test. The difference in mean proportional values before and after 
E time announcement were statistically significant al the 196 
evel. 


Summary 


The purpose of this study was to compare accuracy of com- 
prehension scores secured for slow and fast readers before and 
after a time announcement used to Secure a measure of reading 
rate. The specific hypotheses tested were that the mean difference 
between scores before and after the time announcement was zero 
for each of the experimental groups. n 

Fast readers completing the test had a mean difference of 
—.04. Slow readers completing the test had a mean difference of 


~.14, while slow readers not completing the test had a mean 


difference of —.19. Statistically significant differences between 


446 


Desmond L. Cook 


the proportion of items correct of items attempted before and after 
a time announcement were obtained for groups of slow readers 
but not for fast readers. 


These results suggest that a time announcement used to obtain 


a rate score is associated with more lowered comprehension scores 
after such an announcement than before it for slow readers but 
not fast readers, as measured by proportion of items correct of 
items attempted in the two respective time limits. 
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UNDER- AND OVER-AGE GRADE-PLACE- 
MENTS AND SCHOOL ACHIEVEMENT 


Jack A. Hotes 
University of California, Berkeley 


AND Carmen J. FINLEY 


Sonoma School Districts, California 


es of investigations at the elementary 


school level designed to establish relationships between the degree 
of over- and under-age grade-placement and relative success in 
various school subjects. The first two studies, on Grades V and 
VI, have already been reported (3, 4). The present paper pre- 
sents data for Grades VII and VIII and then discusses the theo- 
vetieal significance of the findings. 

_ The hypothesis being tested holds that certain academic dis- 
ciplines at the elementary level are more important than others in 
contributing to nonpromotion, promotion, and acceleration. 


Methodology; "Tests 


and 312 girls and 
drawn from pupi 


This study concludes a seri 


Samples, 


" For Grade VII, 293 boys 
18 boys and 293 girls were 


elementary school districts in Sonoma County. . 
Grade-Placement Deviation (GPD) was determined by com- 


puting the difference between each child's Actual Grade-Place- 
ment (AGP) and the grade he should have been in according to 


his chronological age. This latter grade is defined as his Chrono- 
nent (CaGP). Grade-Placement Devia- 


logical-age Grade-Placer 
ion, then, is expressed as: 


GPD = AG 


Where K is a constant (5.0) inserted 


all GPD values a positive sign- - 
"Table 1 gives the frequency distributions of GPD's for each 
Sex in both grades. Inspection reveals that, according to their 


for Grade VIII, 
ls enrolled in 68 


p — CaGP + K 
into the equation to give 
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TABLE 1 


Frequency DISTRIBUTIONS or ACTUAL GRADE-PLACEMENT DEVIATIONS 


Grade VII Grade VIII 

Boys — Girls "Boys Girs 
Direction of Deviation GPD Interval F f f f 
(Under-aged) 2.3 + 2.80 0 1b 0 0 
(Accelerated) 1.7 + 2.2 0 0 0 0 
1.1 + 1.6 4 9 3 6 
5+ 1.0 49 72 43 70 
Normal =- a! fl 125 140 124 123 
--— mm 3 70 59 83 55 
—-18- 8 28 16 42 25 
—1.09 — 1.4 12 11 20 8 
—2.5 — 9.0 3 3 3 4 
(Over-aged) —3.1 — 2,6 1 1 0 2 
(Retarded) —-3.97— 3.2 1 0 0 0 
N 293 312 318 293 


a The constant K has been 


subtracted in order to bring these deviations 
from the mean back into the 


ir original form, 


rade girl is accelerated by +2.3 grades; there- 
fore, according to her chronological age she should have been in the (7.1 — 
2.3) or 4.8th grade. 


chronological ages, the seventh-grade boys should have been dis- 
tributed from Grades 5.7 to 10.4 and the Seventh-grade girls 
should have ranged from 4.8 to 9.7; yet, the entire group was 
actually in grade 7.1. 

Likewise, Table 1 shows that, in accordance wW 
logical ages, the boys and girls of the eighth g 
been enrolled in Grades 6.5 to 10.6 and 6.5 to 
yet, they were all in Grade 8.1. 

The California Achievement Test Battery, 
DD, was administered to all seventh- 
during the first three months of the s 
general county testing program in 195 
are: reading vocabulary, reading com 
soning, arithmetic fundamentals, m 
spelling. Reliability coefficients rang 


ith their chrono- 
rade should have 
11,2; respectively ; 


Intermediate Form 
and eighth-grade children 
chool year as part of the 
2. The tests in this battery 
prehension, arithmetic rea- 
echanies of grammar, and 
e from .85 to 95. 
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TABLE 2 


INTERCORRELATIONS OBTAINED ON 293 Boys IN Grape VII 


1 2 3 5 6 Y 
Variable GPD RV RC AR AF MG SP 


1. Grade Placement Deviation .450 .346 .318 .259 .364 .397 


(GPD) 
.150 .650 .533 .622 .704 


2. Reading Vocabulary (RV) 

3. Reading Comprehension .637 .585 .655 -670 
(RC) 

4. Arithmetic Reasoning (AR) „667 .509 .538 

5. Arithmetic Fundamentals „564 .496 
(AF) 

6. Mechanies of Grammar -540 
(MG) 


7. Spelling (SP) 
8 6.15 6.70 6.83 


6.98" 6.98 6.75 6.7 
04 1.80 1.65 


Mean 
.68 1.60 1.39 1,0L 1. 


Standard Deviation 


^ Reconverted: K-Constant Subtracted. 


Findings 


Grade VII 
Table 2 presents the intercorrelations found among the above 
variables and Grade-Placement Deviation for the boys. The 
eriterion correlates highest with reading vocabulary (r — 450) 
and lowest with arithmetic fundamentals (r = 259). : 
The matrix in Table 2 was submitted to the Wherry-Doolittle 
Test Selection Method (7) in order to determine which of the 
test variables made an independent contribution to the variance 
of the pupils’ GPD's. The multiple correlation (R) rises as the 
variables, reading vocabulary, and repere) P — 
mar, are successively selected vely = e 4 u 5 en 
decreases if any of the other tests are added. Toget t ve three 
tests yield a multiple & of 46, significant at better than the 176 
level l T 
Table eins the intercorrelations for the girls. The criterion 
(GPD) correlates highest with reading eor d 4 e =) 


and lowest with arithmetic fundamentals ( 
Doolittle analysis selects only reading vocabulary and spelling 


450 Jack A. Holmes and Carmen J. Finley 


TABLE 3 


INTERCORRELATIONS OBTAINED ON 312 GIRLS IN Grape VII 


1 2 3 "m 6 7 
Variable GPD RV RC AR AF MG SP 
1. Grade Placement Deviation .980 .355 .278 .233 .269 .332 
(GPD) 
2. Reading Vocabulary (RV) -770 .569 .477 .644 .619 
3. Reading Comprehension -636 .578 .703 .670 
(RC) 
4. Arithmetic Reasoning (AR) -G18 574.489 
5. Arithmetic Fundamentals .572 .492 
(AF) 
6. Mechanies of Grammar (MG) .598 
7. Spelling (SP) 
Mean N 7.13" 7.18 7.23 6.75 6.40 7.30 7.62 
Standard Deviation -68 1.40 1.34 84 1.16 1.27 1.08 
^ Reconverted: K-Constant Subtracted. 
TABLE 4 
INTERCORRELATIONS OBTAINED ON 318 Boys IN Grave VIII 
Intercorrelations wae 
1 2 3 4 E 
Variables GPD RV RC aR b o d 
1. Grade Placement Deviation .434 .440 .430 .327 .381 .439 
2. Reading Vocabulary (RV) -781 -691 -564 667 699 
3. Reading Comprehension -723 .617 .651 619 
(RC) ! 
4. Arithmetic Reasoning (AR) .451 .612 .549 
5. Arithmetic Fundamentals -634 517 
(AF) 5 
6. Mechanies of Grammar .553 
(MG) 
7. Spelling (SP) 
Mean 7.88" 7.78 7.51 7.75 7.05 7.40 7.58 
Standard Deviation -68 1.56 1.57 1.31 1.46 1.38 1.74 


a Reconverted: K-Constant subtracted, remainder algebraicly added 
to 8.1. 
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TABLE 5 
INTERCORRELATIONS OBTAINED ON 293 Grats IN Grape VIII 


Intercorrelations 


1 2 3 4 5 6 7 

Variables GPD RV RC AR AF MG SP 

1. Grade Placement Deviation .348 .365 .302 .298 .279 .327 

2. Reading Voeabulary (RV) .774 .641 .571 .652 .634 

3. Reading Comprehension .688 .647 .654 .655 
(RC) 

4, Arithmetic Reasoning (AR) .786 .665 .580 

5. Arithmetic Fundamentals .056 .572 
(AF) 

6. Mechanics of Grammar .600 
(MG) 


7. Spelling (SP) 
Mean m He 


8.11" 8.18 8.03 7.85 7.65 8. 
Standard Deviation 1. 


“71 1.49 1.41 1.28 1.62 


^ Reconverted: K-constant subtracted, remainder algebraicly added to 


8.1. 
ions to the variance of 


as variables making independent contribut : 
the two tests yield an 


GPD's for seventh-grade girls. Together, 
R of .393, significant beyond the 1% level. 


Grade VIII 
Table 4 presents the intercorrelations for the boys. nep = 
relates highest with reading comprehension (r = 440) and lowes 


with arithmetic fundamentals (r = 327). The multivariate anal- 
spelling, and arithmetic rea- 


ysis selects reading com prehension, n 
soning. comis wm qm yield an R of .50, significant beyond the 
196 lev ce. 
eR UR intercorrelations for the girls. eid cor- 
relates highest with reading comprehension rs .365) i - 
with mechanics of grammar (7 = 319). Analysis (7) bd Pu E ‘ 
ing comprehension, spelling, and reading vocabulary. Together, 
these yield an & of .38; significant at the 1% level. 


Discussion 


e the results obtained in Grades VII and 


I : 
ot es Grades V and VI already published 


VIII with those found for 
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TABLE 6 
Summary or SELECTED TESTS AND CUMULATING R’s ron 
Grapes V, VI, VII, AND VIII 


Boys Girls 
Grade i 
Tests Samal á Tests poss i 
V | Reading Vocabulary -248 | Spelling 170 
Spelling -268° | Reading Vocabulary 183 
Mechanics of Grammar | .275^ | Arithmetic Fundamen- .190 
tals 
VI | Spelling -365 | Reading Vocabulary .302 
Arithmetic Reasoning  .393 Spelling .821 
Mechanics of Grammar | .394 Reading Comprehen- — .326 
sion 
VII Reading Vocabulary -447 | Reading Vocabulary 377 
Spelling 458 Spelling .393 
Mechanics of Grammar .464 
VIII | Reading Comprehen-  .437 Reading Comprehen-  .361 
sion Sion 
Spelling -483 | Spelling .975 
Arithmetic Reasoning  .497 | Reading Vocabulary .979 


a Cumulative & for Reading Vocabulary + Spelling. 
> Cumulative R for Reading Vocabulary + Spelling + Mechanics of 
Grammar. 


(3, 4), a summary of the selected tests and the cumulative 
multiple correlations found for both sexes at all four levels is 
presented in Table 6. Inspection of Table 6 reveals that: à 

1. Spelling is selected as an important test making a statistically 
independent contribution to the variance of GPD in all four 
grades for both boys and girls. The same cannot be said for any 
other variable in the present battery. 

2. Reading vocabulary and spelling are the two most impor- 
tant determiners of GPDs in Grades V, VI, and VII. 

3. The cumulative Ẹ tends to increase from grade to grade. 

4. The cumulative & is greater for boys than for girls at each 
grade level. 
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TABLE 7 


RANK-ORDERS FOR SCHOOL SUBJECTS PLAYING Most TO LEAST IMPORTANT 
ROLES IN DETERMINING GRADE-PLACEMENT DEVIATIONS 


Calculated from College Estimated from the 
Students’ Estimates R Calculations 


CO ID a eo o n 
Grades V, VI, VII 
CONMNNMNENENEEEEMmE M — 


(N = 75) (Ns = 979 + 869 + 611) 

1. Reading Comprehen- 1. Reading Vocabulary 
sion 2. Spelling 

3. Mechanics of Gram- 


2. Reading Vocabulary 


3. Arithmetic Funda- mar 
mentals 4. Reading Comprehen- 
4. Arithmetic Reasoning sion 
5. Spelling 5. Arithmetic Reason- 
6. Mechanics of Gram- ing 
mar 6. Arithmetic Funda- 
mentals 
-— —— M 
Grade VIII 
(N = 75) (N = 611) 
1. Reading Comprehen- —————————— 1 Reading Comprehen- 
sion sion 
2. Arithmetic Reasoning 2. Spelling. : t 
3. Reading Vocabulary 3.5 Arithmetic Henson 
4. Mechanics of Gram- ing 
mar 3.5 Reading Vocabulary 
5. Arithmetic Funda- 5. Arithmetic Funda- 
mentals mentals 
6. Spelling 6. Mechanics of Gram- 
mar 


eo Uo SC 
r is selected as an important subject 


d VII, but this is not so for the girls. 
hievement in the quantitative 


5. Mechanics of gramma 
for boys in grades V, VI, an 


6. Individual differences in ac 1 i 
subjects (arithmetic reasoning and arithmetic fundamentals, in 


which boys are supposed to excel) appear to play almost no role 
in determining whether or not à student shall be retarded, pro- 
moted, or accelerated in the elementary schools of this system! 

7. Individual differences in achievement in the linguistic sub- 
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jects (reading, spelling, and grammar, in which girls are supposed 
to excel) have a high premium in regard to GPD. 


Could these results have been logically predicted? 


Perhaps. The fact is, however, they were not! A class of 75 
university students enrolled in a teacher-training program was 
asked to rate these subjects in the order of their importance for 
determining GPD’s in Grades V through VIII. 

Table 7 presents the rank orders obtained by preferential anal- 
ysis of the estimates of these university students and compares 
them with those obtained from a preferential analysis of the Js 
of the four grades. The university students stress comprehension 
and relegate spelling to a low order of importance. Their dis- 
agreement with the facts is readily apparent in the almost random 
relationship of the rank orders of the entries in the two columns, 
especially for Grades V, VI, and VII. 


Theoretical Implications 


In essence, these studies recall Keys’ observations (5, 6) that 
acceleration is rare; indeed, the number of pupils that are ac- 
celerated in any one grade represent only a fraction of those 
retarded. The writers agree with Keys’ contention that this docs 
not make for the optimum learning situation for the bright, the 
average, or the dull. The usual argument for the policy of non- 
acceleration is that the child should be kept with his chrono- 
logical-age group. Such a social milieu is supposed to foster both 
maximum mental health and learning. However, these data clearly 
show that the desired homogeneity for chronological age (or for 
that matter, subject-matter achievement) is simply not obtained. 

Baker (1) and Baker and Traphagen (2) concluded from their 
studies that often bright pupils may fail in arithmetic, and spelling 
failure may occur at any level. The relationship of their findings 
to the present study is clear: bright or dull failure in arithmetic 
is not important for GPD; and spelling failure, which they found 
occurs at any ability level, also occurs at any grade level. Its 
occurrence is of great importance for GPD. . 

The theoretical implications of the present series of — 
tions concerning school-grade failures scem clear. The ese 
should come out openly and tell the students and their paren 
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that success in reading comprehension and arithmetic reasoning 
is not nearly as important as success in reading vocabulary, spell- 
ing, and mechanics of grammar for determining retardation, pro- 
motion, or acceleration by the time the child is in Grades V, VI, 
and VII. Or, the schools should make it their business to give a 
more cquable distribution to the weights they place on success 
in each of the subject-matter fields. 

As a matter of fact, for neither the boys nor the girls, in Grades 
V, VI, VII, and VIII, do individual differences in their combined 
achievement in these six subject-matter areas (i.e., reading vo- 
cabulary, spelling, mechanics of grammar, reading comprehension, 
arithmetic reasoning, and arithmetic fundamentals) contribute 
more than 25% to the variance of the grade-placement deviations 
within any one class. On what basis, then, one may ask, are 
students being grouped together in any one grade? On what basis 
are they being promoted, retarded, or accelerated? 

In a scientific age, the apparent unimportance of success in 
arithmetic is perplexing. This condition is no doubt widespread 
in the nation’s schools, and it is quite possible that this is one of 
the major reasons why officials of the armed services complain 
that a great number of men are not well grounded in arithmetic. 

If these data are, in fact, at a 


ll representative of elementary 
schools in general, they point up à pressing need for a re-evalua- 
tion of the relative importance of the courses offered in the school 
curriculum. 

In an age of interplanetary exploration, the outcome of the race 
for world leadership may not depend so much upon the ability 
of a nation to successfully project à satellite into a precaleulated 
orbit, nor even its ability to launch intercontinental missiles, but, 
as Professor Edward Teller has so clearly stated, upon its ability 
to simply take and hold the lead in scientific exploration and pro- 
duction, If the results of the above experiment are at all wide- 
spread, and there is reason to believe that they are, then it appears 
that there must be a revamping of our value judgments on what 
in the curriculum is of the most worth. More than that, knowledge 
for its own sake, especially in mathematics and science, must be 
much more enthusiastically supported and evaluated as important 
accomplishments for children in our elementary schools. 
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ITEM DIFFICULTIES BASED ON END 
SEGMENTS 


Epwanp L. CLARK 
Northwestern University 


In the analysis of test items, there is a strong tradition, origi- 
nated by Kelley (3), for using the proportion of right answers in 
the top and in the bottom 27% of Ss when ordered by a criterion. 
These end groups are used to determine the index of discrimina- 
tion of the items, and some writers recommend the use of these 
end groups, neglecting the middle 46%, for determining the dif- 
ficulty of the items (1). This use of the top and the bottom 
“quarters” of the criterion group for determining the diffculty 
of test item fails to distinguish between those items which can 
be answered by only the top 27% and those items which can be 
answered correctly by as many as the top 73%. Feder (2), using 
the responses of 144 Ss to the 70 items of a classroom test, has 
offered evidence in support of determining item difficulty by the 
end “quarters” of the criterion group. He concluded not only 
that the difficulty could be determined by the end groups, but 
also that the usefulness of this method was not related to the 
discrimination index of the item. 

The purpose of the present study was to compare, by correla- 
tion, the indices of difficulty as determined by the end quarters 
of the criterion group with those as determined by the middle two 
quarters. In order that the effectiveness of the items might be a 
variable in the study, correlations between difficulties were made 
for items of five different levels of discrimination. | 

The data used in this investigation were 1911 item analyses 
which had each been based on 400 Ss. These records had been 
accumulated over a 12-year period as two scholastic aptitude 
tests were being constructed: the Clark Analogies Test, in parallel 
forms and based on a very easy vocabulary, and a second test 
based on a difficult vocabulary. All items were of the multiple- 
choice type. The criterion most often used in these original an- 
alyses was total test scores. At times it was college grades or a 
mixed criterion of test scores and grades. The poorest items were 
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TABLE 1 


DIFFICULTIES as DETERMINED BY TUE Two Exp QUARTERS 
AND BY THE MippLE Harr or Groups 


Phi No. of item End groups Middle groups SD 
Cod. analyses Man SD — Man SD r of all 
0-.24 278 115.0 51.94 114.0 55.89 .984 —— 107.38 
.25-.39 816 124.0 36.08 122.8 48.83 .977 85.03 
.40-.49 497 118.6 29.260 120.9 39.07 .956 67.59 
.50-.59 245 113.6 21.15 116.4 22.20 .912 42.42 
-60-up 75 107.7 16.30 105.1 27.40 .872 42.38 


Note.—The means are the mean number of correct, responses to items 


by 200 Ss, while the correlation coefficients are between the two determina- 
tions of difficulty for each item. 


generally analyzed but once; the better items were analyzed with 
several groups of Ss, Although five different indices of diserimina- 
tion were used at different times, the only index used throughout 
the period was the phi coefficient based on the number of right 
answers made by the highest 100 and the lowest 100 Ss. Con- 
sequently, for the purposes of this study the phi coefficient was 
used to classify the items as to level of discrimination. 

The first line of data of Table 1 concerns those items which 
yielded phi coefficients of less than .25, or items which did not 
discriminate between the highest and lowest Ss of the 
groups. In these 278 analyses of items, the end groups of criterion 
Ss made right answers in almost the same manner as did the 
middle groups. The mean and standard deviation of right answers 
per item were much the same for each group of 200 Ss, and the 
correlation between them was 7 = .984. The standard deviation 
of the number of right answers made by the entire group was 
107.4. These items varied greatly in difficulty. 

For items of all levels of discrimination, the average number of 
correct responses is somewhat above 100, out of a possible 200, 
but for the items of high discrimination level the standard devia- 
tions are much lower. Items which diseriminate well between 
strong and weak Ss not only have an average difficulty near 50% 
but individualy are much closer to this level of difficulty. 

Some importance should be attached to the fact that the stand- 
ard deviation of the number right per item for the total group 
of 400 is more than twice as great as the corresponding standard 


criterion 
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answers of the end groups. The greater 
relative variability of item difficulties suggests strongly that more 
information is being given by the entire group than by the ends 
Since the standard deviations for the middle group were 
arger than for the ends, the middle half of Ss gave 
of difficulty for a majority of the items. How- 

in this study are the several coefficients of 
he difficulties as determined by the end 
While these correlations are very 


h the increasing discrimina- 


deviations based on the 


alone. 
consistently 1 
better evaluations 
ever, of more concern 
correlation between tl 
groups and the middle group. 
high, they get progressively lower wit 
tion level of the items. In passing, it should be noted that the use 
of 400 Ss in each analysis gave rather stable halves and allowed 
the highest correlations to approach 1.00. The use of 800 Ss would 
doubtless have given even higher correlations and the use of only 
100 would certainly have given lower. But the important finding 


in this study is that the use of the end “quarters” alone for de- 
termining item difficulties becomes progressively less satisfactory 
as one deals with increasingly better items. 

The use of the proportion of right answers in 


c the index 
of the eriterion group 10" 


the end segments 

iscrimination 
i Jtem iod, ut x ad de ge 
ot pre EH LT justification for Tei 
al ieu machinery) S "m ifficultes ' 
i lr at hand {or eter i one's items 4 de 
giis eo OON of Ss are used a ani quit gerat \ 

e Meee value, the item dil icul MEL 

&roup, d from the top and the bottom i 
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TABLE 1 
DIFFICULTIES as DETERMINED py THE Two Exp QUARTERS 
AND BY THE MippLE Harr or Groups 


Phi No. of item End groups Middle groups SD 
Cocf. analyses Men SD "us sa r of all 
0-.24 278 115.0 51.94 114.0 55.89 984 107.38 
.25-.39 816 124.0 36.68 192.8 48.83 .977 85.03 
.40-.49 497 118.6 29.26 120.9 39.07 .056 67.59 
.50-.59 245 113.6 21.15 116.4 22.90 .912 42.42 
-60-up 75 107.7 16.30 105.1 27.40 .872 42.38 


3 i S 
Note.—The means are the mean number of correct responses to item 


by 200 Ss, while the correlation coefficients are between the two determina- 
tions of difficulty for each item. 


generally analyzed but once; the better items were analyzed with 
several groups of Ss, Although five different indices of discrimina- 
tion were used at different times, the only index used throughout 
the period was the phi coefficient based on the number of right 
answers made by the highest 100 and the lowest 100 Ss. Con- 
sequently, for the purposes of this study the phi coefficient was 
used to classify the items as to level of discrimination. 

f Table 1 concerns those items which 
yielded phi coefficients of less than -25, or items which did not 
discriminate between the highest and lowest Ss of the criterion 
groups. In these 278 analyses of items, the end groups of criterion 
Ss made right answers in almost the same manner as did the 
middle groups. The mean and standard deviation of right answers 
per item were much the same for each group of 200 Ss, and the 
correlation between them was r = .984. The standard deviation 
of the number of right answers made by the entire group was 
107.4. These items varied greatly in difficulty. 

For items of all levels of discrimination, the average number of 
correct responses is somewhat above 100, out of a possible 200, 
but for the items of high discrimination level the standard devia- 
tions are much lower. Items which discriminate well between 
strong and weak Ss not only have an average difficulty near 50% 
but individualy are much closer to this level of difficulty. 

Some importance should be attached to the fact that the stand- 
ard deviation of the number right per item for the total sot 
of 400 is more than twice as great as the corresponding standa 
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deviations based on the answers of the end groups. The greater 
relative variability of item difficulties suggests strongly that more 
given by the entire group than by the ends 
ard deviations for the middle group were 
consistently larger than for the ends, the middle half of Ss gave 
better evaluations of difficulty for a majority of the items. How- 
ever, of more concern in this study are the several coefficients of 
correlation between the difficulties as determined by the end 


groups and the middle group. While these correlations are very 
h the increasing discrimina- 


high, they get progressively lower wit! 
tion level of the items. In passing, it should be noted that the use 
of 400 Ss in cach analysis gave rather stable halves and allowed 
the highest correlations to approach 1.00. The use of 800 Ss would 
doubtless have given even higher correlations and the use of only 
100 would certainly have given lower. But the important finding 
in this study is that the use of the end “quarters” alone for de- 
termining item difficulties becomes progressively less satisfactory 
as one deals with increasingly better items. 
The use of the proportion of right answers in 


of the criterion group for determining the index of discrimi 
method, but this fact, in this age 


is admittedly an approximation r : 
of card-sorting machinery, seems n for not using 
all the data at hand for determining item difficulties. However, 
if large numbers of Ss are used and if one’s items are of little 
discrimination value, the item difficulties may quite accurately be 
determined from the top and the bottom “quarters” of the criterion 


group. 
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THE INTERRELATIONSHIP OF AREA TEST 
SCORES AND COOPERATIVE GENERAL 
CULTURE TEST SCORES 


Joun C. Wricur 
DePauw University 
AND Barron B. SCARBOROUGH 
Florida State University 


In the fall of 1954, Educational Testing Service (ETS) made 
available the Area Tests, These tests were “... designed to assess 


the broad outcomes of education in the liberal arts” (4). A descrip- 
tion of the tests further stated that“... the tests have been aimed 
beyond the details of specific courses in an attempt to reach to- 
ward those goals of education which are shared generally by the 
colleges offering instruction in the liberal arts" (3), The Cooper- 
ative General Culture Test, also available f rom ETS, was designed 
^... to provide an indication of the Student/s general background 
in five important areas of the college curriculum” (4), 

In February 1955, the Area Tests were administered at DePauw 
University in an effort to gain an over-all estimate of student 
progress in a liberal arts setting. These tests Were administered 
since an estimate was desired of student progress in acquiring a 
way of thinking and dealing with problems which transcended the 
mere recall of a volume of relatively isolated academic informa- 
tion. The Cooperative General Culture Test was not used since 
there was some evidence to indicate it merely attempts to assess 
the student’s knowledge of a wide range of academic, factual in- 
formation (2). The Cooperative General Culture Test is not con- 
sidered an adequate measure of liberal education (1). 

Inasmuch as the Cooperative General Culture Test had been 
administered two years previously to the same group of students, 
there was a question of the distinctiveness of the two instruments. 
Therefore, this study was undertaken to see what ste 
existed between the scores of students on the Area Tests eie 
scores of students on the Cooperative General Culture Tes 
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DePauw University. Seemingly, the two tests were designed for 
quite different purposes. 


Procedure 


In February 1955, the Area Tests (Form I) were administered 
to the senior class of DePauw University (V = 269). These tests 
were returned to ETS for scoring. Scaled scores were furnished 
for each student. Of the 269 seniors who took the Area Test, 231 
were administered the Cooperative General Culture Test in Feb- 
ruary 1953. 

Conditions of administration were comparable in that the same 
room was used, both tests were part of a university testing pro- 
gram, and the test administrator in each instance had had years 
of group testing experience. 

Product-moment correlations were computed for the scores of 
each of the subtests of the Cooperative General Test and the re- 
lated area of the Area Tests. Thus, the scores for Current Social 
Problems and History and Social Studies were correlated with 
the score for the area of Social Studies. The scores for Literature 
and Fine Arts were correlated with the score for the area of 
Humanities, The scores for Science and Mathematics were cor- 
related with the score for the area of Natural Science. The inter- 
correlations were obtained for each sex and for the sexes combined. 


raw scores were used for the Cooperative General 


In all instances 
ores used for the Area Tests 


Culture Test scores and scaled se t 
scores, Raw scores for the Area Tests were not available. 


Results 


ation indices are shown in Table 1. 
fficients were higher for the women 


fficients for the women was below 


The intercorrel 
In general, the correlation coe 


than for the men. None of the coe r l 
.60 while two of these were in the seventies. The lowest single 


coefficient was .53. This index, which was for the men, was between 
Mathematics and the area of Natural Science. Both of the co- 
eflicients for the area of Humanities for men were .58. All other 
Coefficients for both men and women were in the sixties. 


Discussion 


The obtained coefficients of interrelationship of the Area Tests 
and the Cooperative General Culture Tests were, in general, rela- 
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TABLE 1 
CORRELATIONS or AREA TEST Scores anp G 


ENERAL 
CULTURE Test SCORES 


Senior Senior 
Men Women 
Area Test (N = 123) (NV = 108) 


General Culture Test 


Current Social Problems 


Social Studies .64 .06 
History & Social Studies Social Studies 64 75 
Literature Humanities .58 1 
Fine Arts Humanities .58 72 
Science Natural Science .66 .66 
Mathematics Natural Science .53 .60 
tively high. The coefficients were 


e not as high as might be expected 
for a two-year interval on a test-retest basis for either test. 

Many possibilities exist which might account for the obtained 

ion of certain facts is a pre- 


ther facts, certain factors are not 
st used ( 


evidence su 


ggests further 
the two ins 


truments, 
Summary 

The Area Tests and the Cooperative General 

designed for somewhat different purposes. The f 


Culture Tests were 
to assess the broad outcomes 


) ormer was designed 
of a liberal arts edueati 
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of possibilities might account for the i 
dence seems sufficient to warrant furt 
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ndices. However, the evi- 
her investigations of the 


distinctiveness of the two tests. 


e 


A 


. Buros, O. K., (Ed. 
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. Educational Testing Service. Catalog of ci 


References 
) The third mental measurement yearbook, New 


Brunswick, Rutgers Univer. Press, 1949. 
easurement yearbook, New 


. Buros, O. K. (Ed.) The fourth mental m 


Jersey: Gryphon Press, 1953. 
tutional testing program; handbook for 


J.: Author, 1954-55. 


deans and examiners. Princeton, N. $ : 
ooperative tests, Princeton, 


N. J.: Author, 1956. 


Received October 9, 1907. 


Book Reviews 
DANIEL Brower, & Lawrence E. Amr. Progress in Clinical 
Psychology. New York: Grune & Stratton, 1956. Pp. 364. 

This volume fulfills the promise made in the first, which pre- 
sented progress in clinical psychology through 1951. This study in- 
cludes primarily the years 1952, 1953, and 1954, with several sub- 
jects not appearing in the first volume going back as far as 1946, 

The introduction deals with clinical psychology from a trans- 
actional point of view; Part 2 includes discussions of the Rorschach 
method, thematie apperception and fantasy tests, and validity of 
sentence completion tests and human figure drawings. Part 3 dis- 
cusses therapy in relation to psychoanalysis, client-centered ap- 
proach, group methods, reading and other subject disabilities, 
speech disorders, play, music, art, bibliotherapy, psychodrama, 
hypnosis, and dreams. Part 4 reviews clinical psychology in cor- 
rectional institutions, rehabilitation of the physically handicapped, 
and administrative problems in state institutions. Part 
problems of neuropsychology, statistics and research. 

The new subjects not included in Volume 1 are h 
speech, reading, and some specialized therapeutic techniques. 

The very condensed nature of the studies may be seen from the 
fact that one sentence may have from three to six references, and 
eight pages of text may have six pages of bibliography. Many state- 
ments are references to other summaries, Including all the bibliog- 
raphies at the ends of chapters, there are 1560 references. The 
ten subjects with the most extensive bibliographies are the fol- 
lowing: dreams (178 references); group psychotherapy (167); 
rehabilitation of physically handicapped (163) ; statistics, reading, 
ete. (each 155); neuropsychology (114); psychoanalytic techniques 
(75); play, etc. (70); research (64); hypnosis (60). These numbers 
must not be taken as a measure of anything, but they perhaps 
suggest the interest of workers in the past few years as represented 
by the many authors of the present volume—something which 
might have been considerably different if there had been other 
writers. 

While giving considerable amounts of information, the study 
primarily refers the advanced student to most of the best in the 
field. This is not for beginning students, but implies a foundation 
in the subject. Written clearly in general, it shows some of the 
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5 takes up 


ypnosis, dreams, 
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ambiguity, paradoxes, and contradictions now existing. Even with 
these writers we do not quite get away from “faculties” and "fixed 
tools of thought." Some of the ambiguity in the field of clinical 
psychology secms, in the language of the psychology of learning, to 
have transferred. 

In general these are scholarly reports by specialists writing 
in their separate fields of interest. and bring the reader up to 
date (through 1954) in clinical psychology, probably as carefully 
and accurately as the difliculties of the subject matter permit. : 

Of special merit are the wide scope of bibliographical material, 
excellent eritieal comments, and suggestions for improvement in 
procedure and for further research, warnings against faulty and 
unscientifie methods, and indications of the gaps that need more 


attention and filling in. 
A good index of subjects completes the volume. 
A. 8. Epwarps 


The University of Georgia 


Friepa Froma-Rercumann, & J. L. Moreno (Eds.) Progress in 
Psychotherapy, 1956. Grune & Stratton, 1956. Pp. 352. 

This is a presentation by about 45 writers and collaborators of 

work, and progress of the various schools of 


the philosophies, 
ew out of the “program offered by our new 


psychotherapy. It gr 
Section at the annual meeting of the American Psychiatric Asso- 


ciation in Atlantie City, 1955.” Additions were made to include 
schools which were not heard at the meeting. It is hoped that it 
will help “to offer a platform for unprejudiced oral and written 


presentation and comparison of data by representatives of all 


schools of psychotherapeutic thinking.” 
Part 1 presents something of the history and philosophy of 


psychotherapy. Part 2 deals with principles of psychotherapy. 
Part 3 presents statements by workers in psychotherapy who 
indicate the nature and methods each of his own school. Part 4 
gives summaries of developments in seven foreign countries. Part 5 
is a summary of the present status and possible future of psycho- 
therapy. T 
Part 3, dealing with various schools, includes diseussions on 
the rediscovery of the patient, Adlerian psychotherapy, psycho- 
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biologic psychiatry, Jung’s concepts, active analytic therapy, 
existential analysis, psychodrama, the work of Sullivan and 
Fromm, the views of Horney, communication, biodynamic inte- 
gration, client centered therapy, multiple therapy, hypnotherapy, 
countertransference, problems of schizophrenia, group psycho- 
therapy, and prevention and treatment for psychotic children. 

An immense number of techniques are indicated; in fact, in his 
summary, Moreno states that “One hundred and forty-two tech- 
niques are mentioned in this book.” They may, according to the 
same author, be classified under seven headings: “physical, pharma- 
cologic, hypnotic, individual, group, action, and technological 
psychotherapy.” The immense sweep and inclusiveness of the 
subject matter is matched by general excellence of most of the 
articles. This might be expected since each school in the major 
presentations is discussed by a selected representative and worker 
in that particular school. For the foreign countries, Austria, Great 
Britain, France, Germany, South America, Spain, and Switzerland, 
the summaries are generally written by one person. 

In general, there appear to be very frank and humble state- 
ments with clear recognition, at least by many, of how little we 
know and how far we have to go in understanding psychotherapy. 
It may appear to many readers that the book contains an amazing 
collection of unverified concepts and reified hypotheses. Many 
clear distinctions are made, but we can even find suggestion that 
psychotherapy and psychoanalysis are indistinguishable. There 
are many statements of problems, successes and failures; but there 
are also suggestions of arrogance existing in the field, of unaccept- 
able dogmatisms, and broad generalizations on insufficient data. 
In the multiplicity of attempts to find foundations for psycho- 
therapy, it is refreshing to find Masserman referring to sound 
experimental work. In some cases it would seem that experimental 
psychology and general psychology of the textbooks were treated 
with thorough disregard if not with contempt. Some formulations 
seem to be straight from the armchair. 

It is pointed out that practice is probably better than theory 
and definition, and that much of successful therapy lies in the 
personality and competence of the therapist. The heavier weight- 
ing of the volume is on psychoanalytic concepts; rather little 
recognition is given to interpretation in terms of learning. The 
experimental psychologist reading this book might like to ask how 
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far astray can we get when imagined entities are substituted for 
experimental facts? He would not be favorably inclined towards 
“omnipotent (unconscious) clements;" and he would be inclined 
to agree that there is too much jargon, too many “high-falutin 
words,” and the danger of too many “parochial attitudes.” 

Some of these suggestions of adverse criticism have been frankly 
recognized by some of the writers and properly deprecated. Lack of 
scientific procedure is recognized and admitted in some of the 
but many of the statements are very carefully 
r scientific psychology and 
of psychotherapy get too 


presentations; 
worked out with due consideration fo 
for not letting theory and philosophy 
far out of hand. 

It would be difficult to find any book which would better put 
the reader in touch with the psychotherapy of 1956, or, should we 
say, with the psychotherapies of 1956. Two indexes are well made 


and put the reader in touch with most of the details of the book. 
A. S. EDWARDS 


The University of Georgia 


J. E. WarnLACE WALLIN. The Odyssey of a Psychologist. Lyndalia, 


Delaware: Author. 1955. Pp. 243. 
The author, who has more or less restrained himself in more than 


three hundred publications (books, journal articles, reviews and 
news articles), now presents Y hat he wants to say about the efforts 
of half a century in the interest of handicapped children, The 
book is dedicated to the very many who “rendered the Author 
Unflagging Support and Encouragement In the Implementation of 
Services for Children Handicapped in the Race for Life by Acci- 
dent, Disease, or Inheritance.” 

Tt is an autobiography and has two special concerns: first, “to set 
forth in some detail the nature of the factors that were influential in 
the making of a special educationist and clinical psychologist—in 
fact, that proved to be indispensable for the accomplishment of 
this purpose; second, to provide some insight into the intricate and 
complex problems of personal and professional adjustment in the 
field of education that may retard the progress of the work, or even 
derail it—not because of any flaw in the work itself but because of 
human weakness—and lend to waste of public funds and blighted 


hopes and disappointments.” 
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The report includes personal experience in and knowledge about 
31 training schools, colleges, and universities in 18 states. It 
covers also most of the period of the development of the work to 
which he dedicated his life and to which he probably gave more 
of pioneering efforts than has any other person. It is the story of 
his “pioneering experiences in special education, clinical psy- 
chology, and mental hygiene.” 

More or less chronologically, Wallin recounts his experiences and 
observations in schools and colleges; successes and failures; sources 
of encouragement and of discouragement; obstacles, both personal 
and impersonal; the defense of vested rights; progress and collapse 
of programs; difficulties in getting bills passed in legislatures; the 
folly of some psychologists; administrative fairness and unfairness; 
loyal support and strong opposition; untrained psychologists; 
personal views, both favorable and unfavorable of individuals; 
attitudes of M.D.s and the improving relations between psychia- 
trists and clinical psychologists; some of the ills of schools, of 
extension work, and of publication. 

The Odyssey is interesting as a story; it is significant as giving 
much of the history of the development of clinical psychology and 
of special classes for handicapped children. Many details found in 
this volume have probably never been printed before. 

The desire of Wallin was not only to give accurate information, 
but to render assistance to the curing of ills that still continue: 
“Little hope exists of removing unwholesome conditions that ham- 
per the efficient functioning of public institutions unless the 
conditions are frankly revealed to the professions concerned and to 
the public.” And he adds: “Fortunately, much in the recital 
redounds to the credit, honor, and glory of those who have contrib- 
uted of their might to pioneering in the specific areas in which I 
have labored.” 

This is a significant and interesting supplement to the history 
of psychology, especially in connection with its applications to 
human improvement. It appears at a time when the work to which 
he dedicated his life is receiving greatly increased interest, effort, 
and substantial support. 

The book includes almost a complete bibliography of Wallin’s 
publications (Pp. 217-243). 

A. S. EDWARDS 

The University of Georgia 
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Grecory A. Kimpur. Principles of General Psychology. New 

York: Ronald Press, 1956. Pp. 400. 
This appears to be a very sincere and conscientious effort to 
present beginning psychology, with great emphasis upon behavior 
hology is defined 


and the extremely objective point of view. Psye 
as the “science of behavior.” It deals only with “the aspects of 
behavior that are observable." Experience, mind, and consciousness 
are excluded from the definition, but introspective reports are 
acceptable. “It is not the subject's private experience which 
counts. It is, rather, the report of such experiences which is open 
to public inspection, verifiable, and testable. Reports of experience 
made by introspective psychologists are quite as acceptable as any 


other scientific records.” 
The presentation is in four 
ing general orientation, psycholog 
statistical methods, intelligence and aptitude testing, and funetions 
of intelligence; 2. Sensation and Perception: sensory processes in 
general, vision and audition, and perception; 3. The Modification 
of Behavior: psychological development, conditioning and simple 
arning, verbal and motor learning, extinction, 
; 4. Behavior dynamies: the primary 
stress and conflict, 


parts: 1. Basie Methodology, includ- 
y as a science, graphic and 


learning, selective le 
habit breaking, and forgetting 
drives, secondary motivation, frustration, 
personality and adjustment, and behavior pathology. 

The author at the outset attempts to clarify the meaning and 
work of psychology as à science and to correct common misconcep- 
ibout psychology. The book proceeds to deal with psychology 
e science, emphasizing the S-R and R-R 
Emphasis is placed upon understanding 
t, and of the refinement of 


tions: 
strictly as an objectiv 
formulae but not S-O-R. i 
the place of theory, and of experimen 


al control in real science. 
ition of graphical methods and a 


hich the median is unfortunately 
defined as “the middle score in a distribution.” 
ical faets and principles pays 


The presentation of psychologi 
thorough respect to experimental work and admits the S-P-R para- 


digm in connection with perception. At least the word "experience" 
leaks in for, *Perception refers to more complex experience in 
which faetors such as learning and motivation often play an im- 
portant role." Of special note are the 211 well-selected illustrations 


experiment 
There is an appropriate expos 
brief statement of statistics m W 
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which not only help in clarification but add much of interest to 
the book, which in general is simply and clearly written. 

The student will be interested in the discussions of behavior 
dynamics and of mental disorders. The author has attempted to 
give a careful evaluation of Freud’s psychoanalysis. But it will be 
hard for many to classify him with Copernicus and Darwin, and 
impossible for more to agree that his psychoanalysis was “The 
forerunner of all of the psychotherapies.” There was considerable 
psychology of motivation before Freud. 

The book is to be praised for its many excellent presentations, 
discussions and warnings to the student. Its double columns, many 
illustrations, and emphasis upon psychology as an objective science 
will appeal to many. It gives a well-selected collection for the 
beginning student who should read some other references for 
somewhat different treatment of some subjects. Each chapter gives 
a few suggestions for further reading and a longer list of references. 
It contains some most recent material not found in many texts. A 
subject index and a name index conclude the book. 


A. S. EDWARDS 
The University of Georgia 


GroncE E. GARDNER, (Ed). Case Studies in Childhood Emotional 
Disabilities, Vol. 2. New York: American Orthopsychiatric 
Association, 1956. Pp. 368. 

The foreword of this volume (Vol. 2) indicates the continued 
hopes that were present in the presentation of the first volume in 
1953. These were that the detailed discussions of cases “by selected 
Association members in the three fields of Child Psychiatry, 
Clinical Psychology and Psychiatrie Social Work" would provide 
valuable teaching material for these and allied fields. The success 
of the first volume, requiring a second printing, has led to the 
publication of the present one. 

The cases presented include collaborative treatment of a mother 
and son; planned return of child to own family; defense mechanisms 
of a six-year old boy; an acute separation problem; pupils psy- 
chologically absent from school; encopresis; the anal phase; 
brother identification in an adolescent girl; a young delinquent 
with psychotic mechanisms; ego treatment; a girl with atypical 
development; the congenitally blind; a hyperactive, destructive 
boy; adolescent delinquents; and a borderline psychotic child. 
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The degree of detail and thoroughness of study and report is 
at least suggested by the fact that there are only 15 reports in & 
368 page book. The shortest report is 10 pages and the longest 43 
pages in length, the average being a little more than 24. The amount 
of work in preparing these reports is more than matched by the 
expenditure of time in therapy (e. g., 200 visits by one boy and 
125 by his mother; treatment for more than 4 years). After such 
intensive work and detailed reporting, the reader may come to 
feel that he is quite well acquainted with at least some of the cases. 

Whether or not the reader is favorably inclined towards the 
s and doctrines underlying the therapy, he may 
well be impressed by the skill of the therapists, the long continued 
and persistent efforts to give much needed help, the expensiveness 


of the program, and the value of the results when they are finally 
attained. None can study this volume without gaining valuable 
insight into the skills and techniques of selected" workers in the 


hypotheses, theorie 


field. 
In general, the reports are simply and clearly written, and, aside 
from underlying theory, the student should have no trouble in 


understanding what he reads. 
al with drug therapy, but considering its 


These reports do not dea 
value and the success in teaching of the first volume, the reviewer 
cannot help but hope that such a book may appear, which, in 


similar fashion, will report case studies of the combined use of 
modern drug therapy and psychotherapy. Perhaps this book is 
the forerunner of such a volume. If drug therapy cuts time and 
expense, we may still be forced to agree with one of the discussants 
who remarked on a report of “four years of work with the mother 
and five years with William": “Possibly the greatest single con- 
tribution of research projects of this nature will be to prove to us 
that intensive treatment of very disturbed personalities is justi- 


fied.” 
Page 368 gives the table of contents of Volume 1. 
A. S. Epwarps 


The University of Georgia 


Psychological Diagnosis and Counseling in 


k: Henry Holt, 1956. Pp. 401. 
tten in the belief that the personnel 


ess of where and how he may wish to 


STANLEY S. MARZOLF. 
the Schools. New Yor : 
“This text has been WII 
worker in education, regardl 
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specialize, should have a single, basic orientation." Emphasis is 
placed upon guidance. “Inherent in the conception of education as 
guidance is the recognition of a need for procedures for working 
with individuals personally, often privately, and with a high re- 
gard for those rights of individuality which characterize a free 
society." 

Part 1 deals with the foundations of diagnosis and counseling: 
the sociological setting for individual methods, the evaluation of 
behavior and the theory of diagnosis; Part 2 takes up the case 
study: pertinent information, direct observation and school records, 
interviewing and physical measurements; Part 3 deals with testing: 
the characteristies of tests, administering tests, discussions of dif- 
ferent kinds of tests, remedial teaching, counseling in theory and 
in action, recording and reporting. 

In general, the book gives a survey of the work and duties of the 
personnel worker in schools, especially of the school psychologist. 
The problems are dealt with extensively rather than intensively, 
and may, in some respects, seem to overlap too much a course in 
testing. 'The discussion of testing is more extensive than that on 
counseling and therapy, and many readers will wish that more 
space had been given to the latter subjects. 

A list of tests indicating “landmarks in the development of 
testing" is given on page 194. The Stanford-Binet revision (1916) 
is included but not the T'erman-Merrill revision (1937). The latter 
is mentioned on page 241. Some of the subjects do not seem to be 
easily found in the index. 

The sweep of subject, matter seems to be rather broader than 
the title implies and appears to have erowded out space that might 
be expected to be given to actual diagnosis and counseling. 

On the other hand, the wide sweep and comprehensiveness may 
do exactly what the author wished, namely, to orient the student 
in the field as à whole and the problems that are involved. Much 
interesting and valuable information is given. There are 17 figures 
and 21 tables. Many valuable suggestions are given in relation to 
sources of error in the work of the counsclor and warnings against 
pitfalls and lack of proper preparation and training for the work. 

The text concludes with a name index and a subject index. 


A. 8. EDWARDS 
The University of Georgia 


— 
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Gress Myers BLAIR. Diagnostic and Remedial Teaching. New 
York: Macmillan, 1956. Pp. 409. 
The title of this book may raise the question, What is diagnostic 
teaching? The book is better than the title and deals with diagnostic 
testing and remedial teaching. It includes certain important sub- 
jeets of special interest to teachers of elementary schools, but which 
ed upon teachers at higher levels. 
deals with how to find out what the trou- 
are the causes of reading or 
he best techniques for 


are also fore 
The text very properly 
ble is before trying to remedy it. What 
of other disability, and then what are t 
remedial treatment? 
“Remedial teaching” in this book “refers to the two following 
types of activities: 1. Eliminating ineffective habits and unwhole- 
some attitudes, and reteaching skills which have been incorrectly 
learned (remedying defects). 2. Teaching for the first time those 
habits, skills, and attitudes which have never been learned but 
should have been, and which are needed by pupils (developing 
increased competence).” 
Part 1 considers diagnos 
the nature and results of remedial programs, 
reading disability, techniques for remedial treatment, remedial 
programs in elementary and high schools, junior and senior, read- 
rials and practice exercises. Part 2 deals with arithmetic, 
dwriting, and fundamentals of English. Part 3 dis- 
ing skill in remedial teaching: making case studies 


medial teaching. 


is and remedial teaching of reading: 
determining causes of 


ing mate 
spelling, han 
cusses improv 
and preparing for re 
The text is enriched by 25 Tables, 
This is a valuable and timely book in a day when increased de- 


mand is being made for both diagnosis and remedial teaching. The 
importance of the book can be better understood when it is known 
that children are still being sent to clinics; for example, when a 
boy 1314 years old, is sent to the clinic with the question, “Why 
can’t he learn to read?” an examination shows that his vision is so 
defective that he cannot recognize & word on a printed page. 
Teachers who are trained in the materials of this text should be 
able to detect and to diagnose many cases before the children 
reach the age of 13 1g years and will learn much of remedial work 


which is not sufficiently well known at the present time. 
Generally good advice and recommendations are given; class- 


20 Figures, and 16 Plates. 
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room teachers can give many tests, but the reviewer must object 
to the suggestion that diligent study of Measuring T ntelligence by 
Terman and Merrill is a substitute for proper training in the use of 
any of the Binet revisions. Such examinations require thoroughly 
well-trained examiners, both to give the examination and to 
interpret results. 

Many references are at the bottoms of pages, and useful bibliog- 
raphies are at the ends of chapters. A well-made index (Pp. 395- 
409) completes the book. Much valuable material has been gathered 
together in this volume. 

A. 8. EDWARDS 

The University of Georgia 
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LISTENING: A CASE OF TERMINOLOGICAL 
CONFUSION' 


Epx4 Lue FURNESS 
University of Wyoming 


The effect of the auditory mode of presentation on learning and 
memory has been investigated in various ways since 1894. One 
problem which existed in the past and which still exists is that 
of terminology. Is the appereeption of the spoken symbol a matter 
or comprehending aurally? 


of hearing, of listening, or of “auding’ 
Another problem is that of the processes involved in the au ral re- 
ception of the spoken symbol. The purpose of this paper is to show 
the evolution of terminology over the years and to make a differen- 
tiation of the processes involved in hearing, listening, and “aud- 
ing.” 

At the turn of the century arose a very promising growth of 
monographie literature devoted solely to the psychophysics oi 
language as such and devoted also to the conscious process 1N- 


volved in the appereeption of spoken symbols. Among those stud- 
gations of aural 


ies is one which still remains one of the basic investig l 
comprehension. It is a study made by the distinguished American 


educator, William C. Bagley. 

In analyzing the apperception of the spoken sentence, Bagley 
(3) considers symbols apart from their meaning and looks at them 
simply as different forms of stimuli appealing to one or another of 
the sense departments. He shows that certain psychophysical prin- 
ciples condition the efficiency of perception of symbols. In the 
typical auditory symbol, the differences in the expressive stimuli 
are qualitative, temporal, and intensive. In more concrete terms, 
the symbolic elements differ in that they are either consonants or 
Vowels, Consonants are distinguished by modal differences of com- 
Plex nature; they are the result of different forms of adjustment 
of the various parts of the voeal apparatus. Vowels are distin- 
Suished by simpler qualitat: 


ive differenees which result from the 
Modifications of the laryngeal clangs. These clangs are brought 


— 

* This article is part of a stud 
rogram, which was made possible by 
Vyoming Graduate School Council. 
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about by the changes in the form of the pharynx and the buccal 
cavity. Temporal differences are in the rapidity with which one 
symbolic element succeeds another symbolic element. Intensive 
differences are due to the fact that certain symbol elements may 
be emphasized and that some other symbol elements may be 
weakened. 

Bagley continues his explanation by stating that the perception 
of certain letters is sufficient for the production of the verbal 
(auditory-kinaesthetic) idea, and for the purposes of conceptual 
“apperception.” He answers the question concerning the character 
of the associative and recognitory processes that underlie the ap- 
perception of symbols by stating that, in general, the consciousness 
accompanying the apperception of auditory symbols is made up of 
sensational and affective elements. Some of these elements are 
aroused peripherally; others are aroused centrally in connections 
which vary in character with different individuals and under dif- 
ferent conditions. "These connections,” so Bagley says, "are ar- 
ranged in patterns which change rapidly into one another, and are 
in general transitory and fleeting" (3, p. 125). 

Bagley was concerned with the “what” of auditory language. 
Twenty-eight years later, Paul T. Rankin became interested in 
the quantification of oral expression among ordinary people. He 
found that the average adult spends 70% of the total waking time 
in some form of communication. Of that time a person spends 9% 
in writing activities, 16% in reading, 30% in speaking, and 45% in 
listening (12, p. 625ff.). 

In the 30 years since Rankin’s classic attempt to quantify the 
importance of auditory language, academic interest in the process 
of decoding spoken symbols has mounted steadily, but the serious 
question of what to call this form of linguistic behavior still con- 
fronts us. Rankin himself was sensitive to the inadequacies of the 
term “listening,” and he used the word with apology and hesita- 
tion (5, p. 540). 

Because of dissatisfaction with the vagueness of the term "lis- 
tening,” writers have coined a striking array of locutions which 
attempt to denote something more than the mere direction of at- 
tention to sound. In use are such terms as: speech reception, 
thoughtful hearing, getting the meaning from heard words, listen- 
ing with understanding to spoken language, comprehending oral 
utterance, interpreting oral expression, inclining the ear cogni- 
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tively, hearing and understanding, listening as a language art, and 
aural comprehension (8, p. 234). 

We are relatively safe from confusion with the term “aural 
English” so long as we are reading or writing it, but because “oral” 
and “aural” are homonyms in most of the major American dia- 
lects, the safer locution in an oral-aural context is the synonymous 
term, auditory English. Its abbreviation, “auding,” has steadily 
gained currency in the literature of language education (5, p. 540). 

The one who first proposed the term “auding,” Don Brown, says 
that he has adopted it to avoid the ambiguities of such terms as 
“listening,” “hearing,” and “understanding,” as well as to elimi- 
nate the awkwardness of such phrases as “setting the meaning 
from heard words,” “listening with understanding to spoken lan- 
r “interpreting vocal expression.” As Brown uses the 
need for more than mere awareness of percep- 
finition of reading provides the key to 
a definition of “auding.” For example, reading may be defined as 
the gross process of looking at, recognizing, and interpreting writ- 
ten symbols; “auding” may be defined as the gross process of lis- 
tening to, recognizing, and interpreting spoken symbols (6, p. 128). 
The terms, then, which are most commonly used to indicate the 
appereeption of the spoken language Or auditory language are 
hearing, listening, *auding" or comprehending aurally. In the re- 
mainder of this paper the author will attempt to make a differen- 
tiation of some of the functions of these three processes. 

Granted, the teaching of listening continues to be i the edge 
of the language arts problem, and it is still gomito i 7" 
that hearing and listening are identical. ‘Unfortunate y, the dic- 
tionary does not offer a great deal of help in this matter of termin- 


. [17 7 » *to 
ology. Webster says that to listen means “to give ear, 
hearken,” “to give heed,” “tq hear with attention,” and that to 


s reeiving sound.” 
hear means “to have the sense OT faculty of perceiving 
“hear does not necessarily imply, as 


Webster ‘ther that to D 
Er [s pisos and listen always do, vie 
application; e.g, one may hear without BREMEN or mey isten 
without hearing” (11, P- 84). Listening compre € Er 
sumes the ability to hear, but hearing does not o E he a Br 
to listen. Hearing and listening are not synonymous. he m ical 
doctor's concern is with hearing; the teacher's concern is with lis- 


tening (7, p. 140). 


guage," o 
term, he stresses the 
tion of sound. For him, a de 
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The words “apprehend” and “comprehend” serve rather well to 
distinguish between hearing and listening in the sense sought in 
the foregoing paragraph. Remembering that apprehend means “to 
become aware of through the senses” and that comprehend means 
“to embrace or understand a thing in all its compass and extent,” 
perhaps one could say that hearing is the apprehension of sound 
and that listening is the comprehension of aural symbols (11, p. 
84). 

Obviously, listening is more than hearing (4, p. 140). It in- 
volves following attentively the thread of a conversation, the de- 
velopment of an idea, the points of an argument. Like reading, it 
requires comprehension in terms of past experience of the listener 
and involves critical examination of what is heard. Whenever 
attention wanders, a portion of what is being presented is lost (10, 
pi 77). 

Next we may consider listening and its relation to *auding." In 
a highly compressed 17-page chapter on “Auding” in the latest of 
the triennial “Language Arts” issues of the Review of Educational 
Research, Caffrey defines auding as “the process of hearing, lis- 
tening to, recognizing, and interpreting or comprehending spoken 
language" (9, p. 121). The author goes on to illustrate the distine- 
tion between the first two phases, hearing and listening. He cites 
case studies in which patients exhibit ability to recognize the 
sounds of running water, erumpling paper, bells, or footsteps but 
fail to comprehend a word of their native tongue, or respond to 
their own names (8, p. 288). However, “many language specialists 
prefer to regard hearing and listening, like consciousness, as neces- 
sary coxprrioNs rather than as AspECTS of the auding process, and 
restrict their concept of auding to recognizing and interpreting 
hearing language” (5, p. 541). 

Poetry, a cappella choirs, and auto mechanics make special de- 
mands upon the ear, but, as Brown says, the relation between lis- 
tening skill (control of attention to sounds) and “auding” ability 
(competence in decoding spoken symbols) is far from clear (5, p- 
541). That is according to Brown. We may look to the research of 
other investigators. We find that of Frederick B. Agard and Harold 
B. Dunkel. Their research for the project called the Investigation 
of the Teaching of a Second Language, sponsored at the University 
of Chicago, 1944-1947, was used in the United States Army’s lan- 
guage training programs. In these programs, as is well known, the 
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approach was predominantly oral and aural. Agard and Dunkel 
say: “The probability that the determining letters evoke the 
phonetic letter-sound images corresponding to them, and that 
these in turn evoke the complete verbal (auditory-kinaesthetic) 
is follows: the ability to hear 
(2, p. 32). In other words, 


images has been stated paradoxically 
more depends on the need to hear les 
aural comprehension is dependent to a great degree upon supple- 
mentation, Agard and Dunkel's words are (2, p. 33): “We under- 
gaps for ourselves 
without being aware of so doing. ... It is this ability to supple- 
ment whieh makes the difference between auditory groping and 


stand more than we hear because we fill in the 


aural comprehension." 
In another work, Agard has noted that adequate “reception” or 
constitutes one-half of the act of speech com- 


“comprehension” 
omposed of at least three elements (1, p. 292): 


munieation and is e 


soken utterance and transmission of 


1. Auditory apprehension of the sj 
the purpose of physical or linguistic 


the acoustic images to the brain for 
response, 

2. Knowledge of the meaning to be attached to the acoustic image. 

3. Awareness of certain facts or assumptions bearing on the subject of 


discourse but not uttered in the actual speech situations. 


The evidence seems to indicate that the term “listening,” as it is 
commonly employed today, is inadequate. The evidence scems to 
indicate, too, that comprehending aurally or "auding" is the more 
adequate term; and that “nuding” consists of at least six processes: 
(a) hearing, (b) listening, (c) recognizing spoken language, (d) in- 
terpreting oral symbols, (e) supplementing meaning and knowl- 
edge of the symbols, and (f) being aware of facts or assumptions 


not uttered. 
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SOME RELATIONSHIPS BETWEEN SOCIO- 
METRIC POSITION AND LANGUAGE 
STRUCTURE OF YOUNG 
CHILDREN’ 


Fur» ROSENTHAL 
University of California, Berkeley 


The scientific study of language has been receiving increasing 
attention from psychologists and educators as they have become 
more conscious of the pervasive nature of language in human be- 
havior. In the present investigation, a small aspect of the general 
field is isolated for study. It is postulated that a relationship exists 
between an individual's language behavior and his position within 
a group. More specifically, this study is concerned with the problem 
of whether any significant differences can be observed in the spoken 
language used by children of high and low sociometric position. 

A survey of the literature in this field suggests that a relation- 
ship between language and sociometrie position can be demon- 
strated. This is derived from the following conclusions of various 


writers: 
1. Spoken language as a social phenomenon has been related 2 pet 
17 s 
*, y 


aspects of personality including social adjustment (1, 2, 4, 
2. There is some indication that individuals with poor language tend to 
sonal-social adjustment (11, 16, 18). 


be at the lower end of the scale in per I 
3. Social behavior as measured by sociometric methods seems to be re- 
lated to other aspects of personality (5, 8, 10, 14). . . 
4. There seems to be a tendency for the individuals of low sociometric 
Status to possess the less desirable personality traits and to be more emo- 


tionally disturbed than those individuals who have higher sociometrie status 


(5, 18, 15, 19). 
General Procedure 


Samples of spoken language from a group of second-grade chil- 
dren frequently chosen on à sociometrie questionnaire were con- 
trasted with the language samples of isolated,” rarely-chosen 


1 This research was carried out under the direction of Drs. Mary Cover 
Jones, Harold D. Carter, and Read D. Tuddenham of the University of 
California as partial fulfillment of the requirements for the Ph.D. degree. 
The reference to the dissertation appears 1n the bibliography for those read- 
ers who desire a fuller treatment of this problem. 
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children. For the language record a relatively free interview situ- 
ation with an adult was employed, and each child was encouraged 
to talk as much as he wished. 


The Interview? 


The interviews were in two paris—in the first, six open-ended 
questions were asked; in the second, the child was shown six pic- 
tures and asked to tell a story about cach. Each interview lasted 
from 15 to 30 minutes. It was recorded on a Gray Audograph ma- 
chine and later transcribed to typed protocols. 


Sociometric Analysis 


An analysis of sociometrie position was carried out on 11 second- 
grade classes (358 children) in six elementary schools in Oakland 
and Berkeley, California.” Each child was assessed in terms of his 
own class; the number of children in these classes varied from 26 
to 37 (mean = 33). 

In an individual interview, each child w sked to name three 
classmates with whom he would like to share each of four particu- 
lar activities. In treating the data, all the votes a child received 
from his classmates were totaled so as to provide each S with a 
single score. A method developed by Bronfenbrenner (3) was 
then used to convert the raw scores to T' scores for each class. These 
T scores can be considered comparable for the total group by 
evaluating them in terms of the probable number of choices which 
would occur by chance in a group (class) of a given size, Obtaining 
many more choices than would be expected by chance, given a 
class of a certain size, indicates high sociometrie status; obtaining 
fewer choices than would be expected by chance indicates low so- 
ciometric status. 

Children who received fewer choices in their respective classes 
than would be expeeted by chance (3% or less of the time) were 
placed in the “low sociometrie" group; those children who received 
more choices than would be expected by chance (97% or more of 
the time) were placed in the “high sociometrie” group. 


* These questions and instructions are from material made available to 
the present writer by Dr. Walter Loban of the University of California who 
developed the interviews. 

*'The same schools from which the language records were obtained. 
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Sample 

The “high” group contained 10 boys and 10 girls, and the "low" 
group contained 9 girls and 11 boys. In the “high” group, the ages 
in months ranged from 90 to 97 with the mean equal to 94 months 
and the standard deviation equal to 2 months; the figures for the 
“Jow” group were the same. The Kulilman-Anderson IQ scores 
ranged from 92 to 112 in the “low” group with the mean and stand- 
on equal to 103 and 6, respeetively. In the “high” group, 
92 to 114. with the mean and standard devia- 
ely. The difference between the 


ard deviati 
the TQ range was from 
tion equal to 104 and 6, respectiv 
two groups on IQ was not statistically significant. The variable of 


socioeconomie status (in terms of father's occupation) was con- 
trolled by having equal representation of children of various status 
levels in each group. 

Development of Language Measures 
A General Description of Language 


ition, language was evaluated in terms of three 


In this investig: 
Structure, Method, and Communicative 


major characteristics: 
Value. 

Structure. The structural e 
(phonemes) and the connections of 
ete.) —the building blocks out of whic 


lements of speech are the basic sounds 
sounds (morphenes, words, 
h all languages derive their 
ally, this category involves a description of the 
al uses in his attempt at expression. 
ar for all persons of a given 


form. More specifie 
materials which an individu 
These structural elements are simil 
language group such as English, German, French, ete. : 

Method. The second general category—the methodological 
| the materials are used by a speaker in 


in style, word choice, rate of speaking, 
thodological characteristics of 


concerns the way in whiel 
communication. Differences 
voice quality are all examples of me 
speech. 

Communicative Value. 
methodological. Furthermore, Co! 


This category is closely related to the 
nmunicative value can be thought 
of as characteristic of à purposive aspect in language. In other 
Words, as soon a8 verbal behavior is subjected to some nonrandom 
processes or methods, the language can be said to have some func- 
tion—a function that is the direct expression of the rationale be- 
hind the particular methodological cha racteristies Which are being 
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utilized. Thus, if the aim is communication of information, the 
structure must be handled according to methodological considera- 
tions which have been agreed upon by the sender and recipient. 
For example, when two English speaking individuals wish to ex- 
change information, the sounds, words, sentences, ete., can not 
be employed in a random fashon but must follow rules which 
have been agreed upon or accepted (either covertly or overtly) 
by both if any meaning is to be transmitted. 


An Operational Description of Language 


In choosing the actual characteristies of language which were 
to be used in comparing the records of the children, two consider- 
ations were involved. Those aspects of speech were chosen which 
could be measured and evaluated statistically; and an attempt 
was made to retain meaningful, not merely measurable, features 
in terms of the three general characteristies discussed above. The 
following characteristies of language were used in the analysis: 


Total Number of Words. 


Communication Units. 'This aspect of language focuses atten- 
tion on the meaningfulness, the communicative value of a message. 
Loban* in a study of children's speech has developed and used 
the concept of communication units: 


A word or group of words conveying a meaningful idea and having a quality 
of completeness about it is a communication unit. (For instance), ^I know à 
boy with red hair.” If you leave out “with red hair" you have lost the essen- 
tial meaning of that sentence. “I know a boy" does not fnean the:same thing 
as “I know a boy with red hair.” On the other hand, “yes” can be a unit of 
communication when it is an answer to “Are you going?” 


Four numerical scores were derived from this category: 

1. The total number communication units (CU) in a given 
sample of language. 

2. The proportion of words used in CU to the total number of 
words used in a given sample of language. This proportion was 
defined as follows: 


— total of all words used in CU 
cv ' total number of words in sample 


*'This and the following quotes are from an unpublished study by Dr. 


Walter Loban of the University of California. 
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3. The mean of the number of words used in CU. This was 
defined as follows: 


_ total of all words in CU 
Meu = 
total number of CU (n) 


4. The standard deviation of the mean number of words used 
as CU. This was defined as follows: 


XXX — Mov)? 


SDcv = 
n 


X = the number of words in one CU 


ation of how many CU's are given in a sample 
of language; No. 2 is an indication of how much of the total 
language sample is valuable as communication; No. 3 indicates 
the average length of the CU's used in a sample of language; 
and No. 4 indieates how much the individual tends to vary in 


the lengths of the CU that he uses. 

Maze. Mazes are a series of words conveying no meaning; an 
indication of “noise” in the message. Loban has also developed 
and discussed this concept giving some examples of mazes from 


children's speech. The mazes are in parentheses: 


No. 1 is an indic 


1. A short maze at the beginning of à communication unit and deeply re- 
lated to that communication unit: “(I'm goin) ...I'm going to beat him up 
and kick him in the nose.” . 

2. A short maze in the mi nication unit and deeply re- 
lated to that communication unit: fixin' ready t'go home, mah 
mother called me up in the house, have to) ah have to get 
mah hair combed." 

3. A long maze not i 
least not deeply related: 
snow time, he had to have 
...and thas all I think o 


ddle of the commu 
“When I was 
an’ (an’, an’, an 
mmediately related to the communication unit, at 

rogram last Sunday (an’ he, an’ 


«I saw a hunter P. 
lot hu, wah-na when he, uh, not too many dogs, he) 


f that picture." 
were derived in a similar 


merical scores 
e described in connection 


In this category four nu 
d anings to thos 


manner and with similar me 
with communication units: j ; 
1. The total number of mazes in a given sample of language. 
2. The proportion of words used in mazes to the total number 
of words used in a given sample of language. This was defined 


as 1 — Pov- 
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3. The mean number of words used in mazes. This was defined 
as follows: 


total of all words in mazes 
total number mazes (7) 


Mm 


4. The standard deviation of the mean number of words used 
as mazes. This wes defined similarly as above. 

Type-Token Ratio. Johnson has discussed this measure of lan- 
guage extensively and described the many forms in which it can 
be used. He defines the TTR as: 

...8 measure of vocabulary ‘flexibility’ or variability, designed to indicate 


certain aspects of language adequacy. It expresses the ratio of different words 
(types) to total words (tokens) in a given language sample (7, p. 1). 


Verb-Adjective Ratio. This measure is obtained by dividing the 
number of verbs (active construction) by the number of adjectives 
(passive construction) in a given sample of language (2). In 
this manner, a score of 100 indicates that the number of verbs 
and adjectives is equal; a score above 100 indicates that there 
are more verbs than adjectives in the particular language sample; 
and a score below 100 indicates that there are more adjectives 
than verbs. Generally, this ratio seems to be an indicator of how 
active and moving or how passive and inactive a sample of lan- 
guage tends to be (2, 4). 

The language measures described above can be related to the 
three characteristics of language which have been postulated by 
grouping them as follows: 

Structure. This category and its measures indicate the number 
and types of speech characteristics that these language samples 
contained. 

1. The total number of words. 

2. The total number of communication units. 

3. The total number of mazes. 

Method. This category gives a more qualitative description of 
language, and these measures indicate the variability, movement, 
complexity, and the richness of the sample of language. 

1. The mean number of words in the communication units. 

2. The mean number of words in the mazes. 

3. The standard deviation of words in the communication units. 

4. The standard deviation of words in the mazes. 


go 
eo 
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5. The type-token ratio. 

6. The verb-adjective ratio. 

Communicative Value. This category cons 
of the total number of words which are used as communication 
units (and or 1 — Per). This proportion indicates how much of 
the sample of language is meaningful communication and how 
much is “nonsense” from the point of view of a recipient. 


s of the proportion 


Analysis 


The Language Measures 

In order to obtain a relatively standard situation, the analysis 
of language records was limited to the responses to the six pictures. 
The exeeption to this was in the computation of the total number 
of words which was obtained by counting all the words spoken 
by the child during the interview. With respect to the other aspects 
of language, the first 200 words and the last 200 words given in re- 
sponse to the pictures (or smaller segments when a child gave 
less than 400 words in his stories to the pictures) were analysed 
seperately, and the correlation of the scores obtained on the two 
equal segments allowed some estimate of reliability (internal 
consistency) of scores. A total seore in cach category was obtained 
by averaging the two scores or by adding them when interest was 
in a total number, c.g, the total number communication units, 
of mazes, of words in communication units and in mazes (in that 
part of the protocol used for the analysis). In each case the raw 


numerical scores have been transformed to normalized T scores 


for ease of statistical manipulation by the procedure outlined by 


Guilford (6). 
The following ten aspects of language were drawn from the 


protocols of the 40 children for analysis. In the remainder of this 
paper they will be referred to by number: 
1. The total number of words. —— 
2. The total number of communication units (CU). 
3. The percentage of the total words used in the CU. 


4. The mean number of words used in the CU. 
5. The standard deviation of the mean number of words used 


in the CU. 


6. The verb-adjective ratio. d 
7. The mean number of words used in the mazes. 
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8. The total number of mazes. 

9. The standard deviation of the mean number of words used 
in the mazes. 

10. The type-token ratio. 

Statistical analysis of these ten language measures on the total 
group of 40 children indicated the following characteristics: 

1. The ten measures were converted to T' scores and yielded 
roughly equal means (51.7 to 55.4; X4 = 52.4) and standard 
deviations (9.53 to 11.93; Xsp — 10.63). 

2. There seemed to be some tendency for measures two to ten 
to be internally consistent in these records (excluding measures 
seven and nine) with Spearman-Brown reliabilities between two 
parts of the protocols ranging from .47 to .83 (see Table 1). 

3. The intercorrelations of the ten measures did not indicate 
any significant groupings or clusters, nor any excessive degrees 
of overlap between these measures on the present sample (see 
Table 1). This indicates that the ten language categories were 
probably tapping different aspects of language in the present 
sample. 


TABLE 1 


PEARSON INTERCORRELATIONS AND RELIABILITIES FOR THE TEN 
LANGUAGE MEASURES COMPUTED FROM T Scores 


(N = 40) 

poo 2 3 4 5 6 7 8 9 10 Ru 
1 —26 —17 55** —02 —07 —33* —31* 291  —76** 
2 08 15 24 19 24 24  —929 41** 74 
3 —06  —03 14 53**  76** —40** 9g 70 
4 49** —09 14  —23 09 -—09 59 
5 —22 -02 —04 00 17 47 
6 11 18  —13 36* 77 
v 20  —88** —45** 15 
8 09 29 83 
9 —25 43 


** Significance at the .01 level. 

* Significance at the .05 level. 

Note.—Reliability coefficients were obtained with the Spearman-Brown 
formula from correlations between the scores on the two segments of the 
protocols of nine of the language measures. 
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Analysis of Variance 

The next step in investigating the present problem was to ana- 
lyse the variance of the language measures in relation to the in- 
dependent variable—sociometrie status. In the present situation 
this was accomplished by using the three-way classification 
scheme presented by MeNemar as ease XVII (12, pp. 332-335) ; 
this model, with one modification, seemed to be the one that was 
most applicable. 

The breakdown of the total degrees of freedom was in terms 
of R rows (individuals), B blocks (sociometrie groups), and C 
columns (language variables). A modification was necessary be- 
cause the column effect has been taken out when the ten language 
means had been essentially equalized by the transformation to 
T scores; and therefore, the total degrees of freedom had ten 
restrictions, instead of one, induced on them. Furthermore, since 
the individuals from block to block (high and low sociometric 
status) are relatively independent, ie, even though the groups 
are matched for certain variables, there has been no matching 
by individuals—there was no R by C interaction, no Bby R 
interaction, and no triple interaction. Table 2 summarizes the 
analysis of variance. 

T ratios were also computed between the high and low socio- 
ach of the ten language measures in order to 
analysis of the language characteristics 
ble 3. 


metric groups for e 
allow a more intensive 
of the two groups. This is summarized in Ta 


TABLE 2 
ANALYSIS OF VARIANCE 


Source af s? F 2 
2 
Individuals (Rows) | (RB — B) 38 ce (si) | 18.1 s2/s? |<.001 
Sociometric Groups | (B - D 1 2227.8 (sb) 
(Blocks) 


Language (Columns) | O 


B i L1) (C —1)9 | 472.8 (sbe) 
X C Interaction (B )( i M m 
Residual B(R — 1) (C — 1) 342 100.1 (ss) 


Total (RBC — 1) 390 
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TABLE 3 
STANDARD ERRORS or THE DIFFERENCES BETWEEN THE MEANS AND l Rarios 
EN THE Means or THe Hion asp Low SociomeTtwe 


GROUPS on THE TEN LANGUAGE MEASURES (df = 38) 
)guage 
asure SDM t p 
1 >.10 
2 <.01 > .001 
3 2 -01 
4 2 .02 
5 1 2.10 
6 4 «.001 
7 1.68 .10 
8 1 <.10 > .05 
9 2 «.05 > .02 
10 2 .02 


| 


Results 

An overall analysis of variances compared the ten measures of 
language simultancously between the high and low sociometric 
groups. The result of this analysis indicated a nonchance difference 
between the high and low groups. Furthermore, it was discovered 
that the profiles (B x C interaction) of means for these language 
measures were not the same for the two extreme sociometric 
groups. 

The significant B X C interaction in the present situation in- 
dicates that the ten language measures do not consistently differ 
from group to group in the same manner—that is, the high group 
is not superior to the low group in all measures. This interaction 
effect would lead to the expectations that: (a) not all the measures 
show a difference between the groups; (b) the differences obtained 
are not all of the same magnitude (equally significant); and (c) 
not all the measures differentiate these groups in the samo dirce- 
tion, i.e., the high sociometrie group having the "superior" lan- 
guage. In the discussion of the ¢ test analysis presented below, 
this interpretation of the B X C interaction is upheld. 

The ten language measures for the high and low sociometrie 
groups were then compared by ¢ ratios in order to obtain a more 
detailed pieture of the differenees in the two groups. This anal- 
ysis indicated: 

1. There was no significant difference between high and low 
sociometrie groups in the total number of words given. 
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2. The high sociometrie group gave a significantly smaller total 
number of communication units than the low sociometrie group. 

3. The high sociometrie group gave a significantly larger per- 
centage of words used as communication units than the low socio- 
metrie group. 

4. The high soeiometrie group gave a significantly larger mean 
number of words used as communication units than the low 
sociometrie group. 

5. There was no significant difference between the high and 
low sociometric groups in the size of standard deviation of the 
mean number of words used as communication units. 

6. The high sociometrie group had a significantly greater verb- 
adjective ratio (more verbs to adjcetives) than the low socio- 
metrie group. 

7. The high soeiometrie group had a slightly lower (significant 
at 1002) mean number of words used as mazes than the low 
sociometrie group. 

8. The high sociometrie group had a slightly smaller number 
of mazes than the low sociometrie group. 

9. The low sociometrie group had a signifie 


antly larger stan- 
dard deviation of the mean number of words used as mazes than 


the high sociometrie group. 

10. The high sociometrie group h 
token ratio (more different, words rel : 
of words) than the low sociometrie group." 


ad a significantly higher type- 
ative to the total number 


Discussion and Conclusions 


Keeping in mind the hazards of generalizing from this small- 
seale study, the results suggest certain tendencies in regard to 
E S ME x Lei a S is ge 
the language characteristies of these two groups of second-grade 


5 It was noted (in the original study and not described ag ed res 
two of the variances for these ten measures FIM he A ae vo 
groups are significant. Although this must be ag aa : a k — 
the variances taken as a group were not significantly di n a rim 
indicate the following possibilities: 1. The Bis MEO i pan om th 
vary more with respect to total number of — BRUS ipli = tion 
the low sociometrie group- 2- The high sociometri¢ inc " "e [p o vary 
more in their use of verbs relative to adjectives tiem e os —— 
group. This finding would tend to invalidate e - pied E — 
of the two groups on this measure, but since i oe rs 
is highly signifieant (.001) there is a strong possibility hat a a 


; er lev significance. 
ference remains, albeit at a lowe! level of signi 
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children. The children of high sociometrie status tend toward 
language characterized by a high percentage (of the total words 
used) of meaningful communication. Their communication is in 
longer units. They manifest shorter and fewer aspects of “noise” 
(mazes) in the language, a high usage of verbs relative to ad- 
jectives, and a greater variety of words. This group also seems 
to be somewhat more variable with respect to these aspects of 
language, especially in regard to the total number of words and 
the verb-adjective ratio. This latter is only suggested and could 
be an artifact of the tendency of this group to be higher on the 
language measures and, therefore, to have more opportunity for 
variation from zero. 

The children of low sociometrie status tend toward language 
charaeterized by a smaller percentage (of the total words used) 
of meaningful communication. Their communication is in shorter 
units. They use a greater number of "noise" (mazes) aspects, 
longer and more variable mazes, more adjectives relative to verbs, 
and less variety in the type of words. 

There were no differences found for the two groups with respect 
to the total number of words used and the relative variability in 
length of communication units. 

It is possible to state these conclusions in terms of the threefold 
analysis outlined earlier. Although it is not assumed that three 
independent aspects of language exist in terms of the measures 
used here, nevertheless it is meaningful to conceive of language 
in this way for purposes of exposition. 

Structure. The available evidence does not indicate that high 
sociometric status is related to the tendency to use a greater 
quantity of language. The differenees found with respect to the 
total number of communication units and the total number of 
mazes seem more related to the other aspects of language. In 
other words, since the groups do not differ in the total number 
of words used, the findings with respect to CU and mazes might 
be more indicative of qualitative (method and communicative 
value) than quantitative differences. 

Method. The language of children of high sociometric status 
seems to be more active and moving and more varied in terms 
of the types of words employed. The variability in lengths of 
the mazes seems to be greater in the low sociometrie group, but 
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this cola be an artifact as described above. Also, the variability 
ví the lengths of the communication units does not seem to differ 
for the high and low sociometric groups. Even though the in- 
dividuals in the “high” group tend to give longer communication 
units in general, they do not vary the length of these units more 
than the “low” group. 

Communicative value. With respect to the aspects of language 
which seem to be related to communicative value, the findings 
indicate that the children of high sociometric status are clearly 
superior to the children of low sociometric status. The language 
of “high” children is characterized by more communicative value 
than that of the “low” children. 

In general, there was à tendency for children of high sociometric 
status to be differentiated from children of low sociometric status 
in terms of language which is more active, variable (word usage), 
and communicative to another individual. It seems, therefore, 
that children of this age and of approximately "normal" IQ 
tended to use an equal number of words regardless of their soci- 
ometric status with their peers in the type of interview situation 
used in this study. However, the more “popular” children use 
language of superior quality and social usefulness. : 

It is interesting to speculate about these findings which suggest 
that sociometric position is positively related to the quality and 
social usefulness of spoken language. Without making any as- 
sumptions about cause and effect, the following tentative ex- 
planation” is advanced for the present results. The work of other 
investigators which has been mentioned seems to give some sup- 


port for these views. — . 
When an individual is inter: 
high status (being highly chosen 


frequent and more diverse contac 
an “isolated” individual. This in turn would make more com- 


munication channels available to the “popular” individual. mie 
the highly chosen individual will have more per cae yi w Eo 
he can send messages" and a higher probability that he wil 
receive more communication than à less popular person. f 
In addition, three major considerations seem to arise which 
favor the expectation that the verbal communication of a highly 
chosen, popular individual will be superior to that of an isolate. 


acting in a group 50 that he has 
), he will presumably have more 
4 with the other members than 
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1. The highly chosen individual has many channels open for 
outgoing and incoming communication. He has, therefore, many 
opportunities to make use of verbal communication. 

2. Since learning plays an important role in the development 
of verbal behavior, those individuals who have greater opportunity 
to send and receive “messages” (more practice) will presumably 
have more chances to learn successful verbal habits. 

3. The highly chosen individuals are also more likely to be 
rewarded for their attempts at communication by the interpersonal 
relations which result from this behavior. Also, active inter- 
communication between individuals will tend to encourage those 
habits which lead to meaningful transmission and reception—while 
discouraging those habits that would interfere with this task. 


Summary 


Twenty second-grade children of "high" sociometrie position 
were contrasted with twenty second-grade children of “low” socio- 
metric position on their language behavior in an interview with 
an adult. A conceptual schema of language was proposed in terms 
of structure, method, and communicative value; and the groups 
were contrasted on the basis of ten statistical aspects of language 
derived from this schema. 

An attempt was made to control IQ, sex, and age; the groups 
were compared by an analysis of variance and £ test techniques. 
The results showed significant differences between “high” and 
“Jow” groups on language behavior. 
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TRANSFER OF READING TRAINING FROM 
NONTECHNICAL TO TECHNICAL 
MATERIAL 


Marvin SCHWARTZ 
U. S. Naval School of Aviation Medicine, Pensacola, Florida* 


A number of recent studies have attempted to evaluate the 
effectiveness of reading improvement training given college level 
S's (e.g, 2). These studies have been principally concerned with 
demonstrating that reading training results in increased reading 
rate, increased comprehension, and changes in academic grades. 
While these results are important, they do not directly answer a 
typical criticism leveled at reading improvement courses. The 
science or engineering student or the executive may express con- 
cern that, while this training may improve his ability to read a 
newspaper or novel, it may actually interfere with his ability 
to read the technical material associated with his occupation. To 
state it another way: should the materials employed in reading 
training courses be geared to the course of study of the individual 
student, i.e., should philosophy majors be trained on philosophical 
reading and mathematicians on mathematical material? The pur- 
pose of the present study was to determine if reading training 
utilizing relatively fictionalized, nontechnical material would 
transfer positively to the reading of technical material. 

All beginning naval aviation students are exposed to a reading 
improvement course as part of their pre-flight training. It has 
been shown (3) that this training results in improved reading 
speed for the material utilized in training. However, since the 
purpose of pre-flight training is to prepare the student to be an 
efficient naval aviator, this reading training might better be omit- 
ted unless it can be demonstrated to aid the student in the reading 
of the technical material associated with his training and occupa- 
tion. 

Method 


Subjects 

The S's were three classes of students in the U. S. Naval School, 
Pre-flight. The classes contained 27, 30, and 52 students, Te 
pe RS 


1 The opinions and conclusions expressed are those of the writer and not, 
necessarily, those of the Department of the Navy. 
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spectively. Because of academic difficulties in other courses re- 
sulting in failure or setback, and drops for physical reasons, 17; 
17, and 40 students, respectively, completed the entire experi- 
mental procedure. Therefore, 17 from the group of 40 students 
were chosen, at random, to complete the experimental population. 
The general population represented by these three classes consists 
of cadets with a minimum of two years of college education. Each 
had been previously sereened with an intelhgence test, a bio- 
graphical inventory, and a spatial orientation test before being 


accepted for flight training. 


Reading Training 

There are approximately 17.5 hours available for reading train- 
ing during the pre-flight curriculum. Some seven of these hours 
are devoted to instruction in the methods and advantages of im- 
proving reading ability. The principal techniques utilized for this 
purpose are lectures and demonstrations with the Harvard series 
of reading training films. Some seven hours are devoted to actual 
reading practice, principally with the Reading Rate Controller. 
The material utilized for this practice varies from fiction and 
biography to difficult, but nontechnical, essay. Some examples are: 
A Son of the Middle Border, By Hamlin Garland; The Way of a 
Transgressor, by Negley Farson; and Modern Arms and Free 
Men, by Vannevar Bush. While a wide range of difficulty is rep- 
resented, none of the material can be characterized as being 
technical in substance. Previous work (3) demonstrated that train- 
ing on this type of material results in an average a. 
of 88% in reading speed for this material, and, after 10 wee 4 
of no further instruction, 90% of this improvement is retained. 
Comprehension, on the other hand, shows an average ewe m 
of approximately 9% with 5% of this decrement remaining alter 


10 weeks. 


Experimental Material lt. 
for the reading training 1s of a 


Since the material employed : 
nontechnical nature, the object of the present experiment was to 
utilize as test material relatively technical selections which, while 
similar to material encountered in pre-flight and subsequent 

familiar to the Ss. Although 


ground school training, would be unta deu ou 
they are given an intensive military indoctrination and training 


in the use of small arms, students passing through the pre-flight 
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curriculum are not given any instruction about naval weapons as 
employed in the surface fleet. Therefore, the material chosen for 
the present experiment consisted of three selections from a stand- 
ard text on that subject, Naval Weapons and their Uses, published 
by the United States Naval Institute. These were about ships’ 
armor, construction of naval guns, and explosives. They contained 
1800, 1700, and 1725 words, respectively. The material emphasized 
the engineering, physics, and chemistry employed in the construc- 
tion and operation of naval weapons, as well as technical nomen- 
clature. Comprehension tests were constructed to accompany cach 
selection. They contained a mixture of about one-half each of 
true-false and multiple choice items, 20 per test. 

Previous work had demonstrated that reading speed could be 
measured with these selections with a reliability ranging from 
.70 to .80. The reliability of the comprehension tests was some- 
what lower, .40 to .50, but of sufficient magnitude to assess group 
trends, as in the present experiment. 


Experimental Design 


Each class was tested three times: once on each of the three 
selections, the order of administration to each class being de- 
termined by a latin square drawn at random. The essential fea- 
tures of this design are such that it enables independent evalua- 
tions of the changes over sessions of the experiment, differences 
among the test materials, and differences among the three classes: 
(Differences due to order of administration of the tests were con- 
sidered to be negligible and are eonfounded with class differences 
in the present experiment—ef. [1, p. 273] for further explication 
of the design.) 

'The initial reading and comprehension tests were given while 
the students were in the first week of the pre-flight program. 
This is an indoctrination period and was approximately three 
days before the reading training was begun. The posttraining test 
was given on the day following completion of the reading training. 
The retention test was administered an average of 12 weeks fol- 
lowing the posttraining test. (The Christmas leave period caused 
this to vary among the classes. The minimum time between these 
tests was 10 weeks.) At the beginning of each test, the students 
were told simply that this was “a test of your ability to read 
and understand material of a technical nature—something you 
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will be expected to do throughout your respective naval careers.” 
All tests were administered in a different setting from the one in 
which the reading training was conducted. 


Results 

Table 1 presents the mean reading speed and comprehension 
scores over the course of the experiment. Reading speed for tech- 
nical material increased 104° between the initial and posttraining 
tests and showed a 92% improvement when the initial and re- 
tention tests were compared. Comprehension of the material read, 
a decrement. There was a 7.5% decrement com- 
aining tests and a 5% decrement com- 
esents the sum- 


however, showed 
paring the initial and posttr 
paring the initial and retention tests. Table 2 pr 
mary of the analysis of variance performed on the reading speed 
measure. 

The principal result is the finding of a highly significant source 
of variance attributable to differences among experimental ses- 
sions (F = 59.29: 2 and 96 df). Applying the ¢ test to the indi- 


TABLE 1 


Mean READING SPEED AND COMPREHENSION Scores 
— = E = = SE —_—__—_ aan 
d Speed Comprehension 
73.0% 


193 wpm 


Initial 

Posttraining 393 wpm prO 

I osttraining— Initial +101% PEG 

Retention 372 wpm 68.0% 
—5.0% 


+ 92% 


Retention—Initial 


TABLE 2 


NCE PERFORMED on READING SPEED 


Summary or ANALYSIS 
Source-of Variante df d i 
Between Subj i 
: Subjects: 1.59 - 
Sessions X tests (Classes) 2 ye : 
y Residual 48 gon 
Within Subj 
Subjects: a 2 
Between sessions 2 ns pes Eae 
Between tests 2 ap 1.72 = 
Sessions x tests 2 176.98 .72 
Residual 96 108:08 
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TABLE 3 
SUMMARY OF ANALYSIS OF VARIANCE PERFORMED ON CoMPREHENSION 


Source of Variance df Variance F p 


Between Subjects: 
Sessions X tests (classes) 2 30.88 3.32 «.05 
Residual 48 9.29 

Within Subjects: 


Between sessions 2 30.77 5 <.01 
Between tests 2 5.06 4.1 <.05 
Session X tests 2 2.53 — = 
Residual 96 3.64 


vidual means indicated that the difference between the initial and 
posttraining sessions was significant beyond the .01 level (t = 
8.93; 96 df) while the difference between the posttraining and the 
retention sessions was not significant (t = 1.05; 96 df). In ad- 
dition, the test concerned with ships’ armor was somewhat more 
difficult to read than the other two (t = 2.05; 96 df; p < .05). 

Table 3 presents the summary of the analysis of variance per- 
formed on the comprehension scores. Again, the principal result 
is a highly significant source of variance attributable to differences 
among experimental sessions (F = 8.45; 2 and 96 df). Again, t 
tests showed that the difference between the initial and posttrain- 
ing sessions was significant beyond the .01 level (t = 4.06; 96 df) 
while the difference between the posttraining and retention ses- 
sions was not significant (£ = 1.41; 96 df). In addition, the selec- 
tion about gun construction showed a significantly higher per- 
centage of comprehension than did the other two tests (t = 2.49; 
96 df; p < .05). 

Discussion 


The present experiment demonstrated that increased reading 
speed, resulting from a course of training in reading improve- 
ment, transfered to the reading of material of a different type 
from that employed during the training. After being trained on 
relatively light, nontechnical materials, the students in the pres- 
ent experiment demonstrated over 9095 improvement in reading 
speed for highly technieal material. For present purposes, the 
evaluation of the reading training in terms of the source of im- 
provement is irrelevant. However, we can briefly state what it 
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probably was not. It appears unlikely that the improvement is 
increased motivation, i.e., simply giving 
the students a pep-talk on the desirability of reading faster would 
not have resulted in the degree of improvement demonstrated. 
Further, since the training is a required part of the pre-flight cur- 
riculum for all students, the increase can not be ascribed to the 
fact that motivated students secking reading instruction would 
benefit from any course of instruction. More relevantly, whatever 
the source of the improvement, the fact that it transfered indicates 
that the course had the desired effect and that the training has 
benefits beyond the material employed during training. Although 
a completely definitive test of the present problem would involve 
reading training on technical material, testing on technical ma- 
terial, and a comparison with the present program, the high read- 
ing speed transfer found in this study would suggest that it is 
not necessary to train readers of technical material utilizing ma- 


terials from their specialties. 

The finding that comprehension of the material read decreased 
with increased reading speed bears comment. It should be noted 
that this decrement is probably not a result of negative transfer 
from the training on nontechnical material, inasmuch as & similar 
decrement, of the same order of magnitude, was found for the 
nontechnical material after training (3). Thus, the decrement 
would appear to be due to increased speed of reading a 
has become axiomatic in this area to say that the fast reader 
understands and retains more than the slow reader. The present 
findings would indicate that this is probably only partially true. 
It may be true in the sense that (a) the faster reader covers à 
greater amount of material, (b) less time elapses from the be- 
ginning to the end of any given selection, facillitating the pac 
tion of the material, and (c) the initially faster reader e also 
be the more intelligent reader. However, the present i 2 n 
dicate that speed, per se, has à deleterious rather ber a man 
effect, Nevertheless, the magnitude of the derem — a 
in the present instance is & small price to pay for the seit ica 
enormous increase in reading speed. Also, this je T E > 
ably be overcome by stressing that the student : ou oo a 
crease his speed beyond the limits of his ability to comprehe 


the material. 


completely ascribable to 
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Summary 


The present experiment demonstrated that reading training 
utilizing nontechnical reading materials for training transfered 
to the reading of technical material. College level Ss demonstrated 
an average of 10457 improvement in reading speed for technical 
material after training on nontechnical material. After 10 weeks, 
90% of the improvement was retained. Accompanying this in- 
crease was an average decrement of 5-755 in comprehension of 
the material read. It was concluded that it is not necessary to train 
readers of technical material utilizing materials from their special- 
ties. 
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LEARNING ABILITY AND RETENTION 
AFTER LONG PERIODS' 


D. A. Worcester 
University of Wisconsin 


The writer, in the summer of 1915, learned 20 100-word selec- 
tions from the prose writing of Matthew Arnold and Thomas 
Huxley. Twelve of these were learned by the auditory method 
1e, they were read aloud to him until they could be recited amd 
nine were studied visually until a recitation could be made. The 
method allowed silent recitations during the learning process. The 
learner determined the rate of presentation which might vary 
during the course of learning. An accuracy of at least 95% was 
required. The time was the total elapsed time from the beginning 
required degree of accuracy was achieved. 


of learning until the 1 
int of one successful repeti- 


Altho the goal was to learn to the po 
tion, the above method might result in a higher degree of learning 
if there were an underestimate of the degree of memorization. One 
of the selections is here reproduced—one from Arnold. 

aslator of Homer will have, or he cannot succeed, 
1 that disinterested love of it, which are, 
, and so precious; he will not be led off 
for the real matter, and, where he 
nt will be too slight for 
l turn him aside—he will 


m 
The successful future trar 
in true sense for his subject, anc 
0th of them, so rare in literature 


he will have an eye 
of this, no hi 


fections wil 
t with his own. 


by any false scent; 
ee he may find any indication 
iim, no shade will be too fine, no imper 


go before his advisor's thot and help it ou 
1920, these selections were 


atter summer these selections and several 
writer as materials to be presented to 12 
ized them. For those which he had 
tituted overlearning; as to the other 


selections, while there was no attempt to memorize them, a high 
degree of familiarity was gained during the process of reading 


them, perhaps a total of 100 times each, to other persons. 
In the summer of 1936, 21 years after the original learning, 


-_ 
yefore the American Psychological Association in 


Five years later, in the summer of 


relearned. During this 1 
others were used by the 
other persons who memor 
learned previously this cons 


! ! This paper was read | 
New York, September, 1957. 
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the writer again studied some of these selections. He relearned 8 
of the paragraphs originally learned in 1915 and relearned in 1920. 
He also learned 8 selections which were used by him in 1920 as 
materials to be learned by others—which had been, therefore, 
read again and again but not formally memorized. In addition 
he learned 4 wholly new selections, 2 by each method, from each 
of the authors named. 

When the results of the 1936 study were reported to the Amor- 
ican Association for the Advancement of Science, the student 
daily paper of the university where the writer was then employed 
commented upon them under the caption “Senescence No Bar To 
Learning.” Thus encouraged, the writer undertook some more 
work along this line in the summer of 1956. He relearned 8 of 
the selections which he had learned in 1915, relearned in 1920, and 
relearned again in 1936. He also relearned 4 paragraphs which 
had been learned in 1915, relearned in 1920, but not reviewed in 
1936; and 4 which had been learned in 1915, used as materials 
for others in 1920, but had not been relearned at that time nor 
reviewed since. And he learned 4 new passages. In each instance 
half of the selections were done by one method and half by the 
other. 

The 4 new selections learned in July 1956 were relearned in 
March 1957, a lapse of time of about 8 months. 

All of the selections relearned in 1920 employed the same 
method as that of the original learning. But in 1936 some were 
relearned by one method and some by the other. Altho learning 
was more rapid by the visual method, it appeared to make no 
difference in relearning time which method had been used first. 
No attention was given in 1956 to the initial mode of presentation. 

Each of the paragraphs learned in 1915 seemed definitely fa- 
miliar to the subject when relearned in 1936, and this was again 
true in 1956. However, some of them were felt to be much 
“warmer” than others. 

In 1915 and in 1920 the learning exercises were done in the 
daytime as a part of the regular day's work. The tasks in 1936 
and in 1956 were almost entirely done at night and after a full 
day's work at a busy summer session of the University of Ne- 
braska. The degree to which time of day and fatigue as well as 
lapse of time and age may have influenced results is not known. 

The writer had done almost no memoriter work in many years, 
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and he was especially interested to know if at his present age he 
could commit material by rote. There was self-assurance that he 
could, and he felt an eagerness to show that “He was the man 
he was.” Whether it was from overeagerness or from some other 
a definite feeling of tenseness during the study 
periods of 1956 and 1957 which cannot be recalled as having 
been present at the earlier times. There was an unwillingness to 
take a chanec—a hesitancy to try to repeat a selection tho it 
seemed to be learned—but rather an attitude of “I’d better go 
over it once more.” The very feeling of familiarity itself seemed 
at times to militate against good performance in the case of re- 
learning. The content of the reading was quickly apprehended 
but many slips made on details—synonyms were used, for ex- 
ample. It is, of course, to be remembered that the increase in age 
was the same as the lapse of time between learning exercises. 
The results of these studies are shown in the accompanying 
table. In terms of learning new material there seems to be an m- 
crease in time required with age. The writer doubts if time of day 
or fatigue had much, if any, effect. Even the somewhat jittery 
feeling experienced may itself be a characteristic of age. It may 
perhaps be noted that the learning times of 1956 were as low 
or lower than were the average times of those who acted as sub- 
jects of the experiment in 1920 whose ages at that time rs 
from 21 to 43 (there was one who was 58). Data for the origina 
studies are shown in references (1, 2). ; 
It is striking that, altho the visual method pe te: i 
superior to the auditory for this subject, the increase In ams 
time was greater by that method. A hypothesis cee en 
that reaction time, which commonly increases wit d a 67 
the activities of reading more than those of listening. age k 
the vi vas markedly superior to the auditory metho 
visual method was 


H . . 
The ius concerning retention show several erg d Eu 
Material relearned by the auditory method oe os st 
Savings over that originally learned by that = E M em 
long lapses of time and conresponding actually required slightly 
origi yisua j : 
cari PX b: sat cases, for relearning than for temian 
i the feeling of familiarity experience 
portant fact that 


On the other hand, one notes : 
as the selections were reread and also the very im 


cause, there was 
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TABLE 1 
LEARNING ABILITY AND RETENTION AFTER Lonc PERIODS 
| Average Learning Times? 
I j — New Materia 
Date | Age we % over ee pt 
A 4915 id 
1915 26 |21 A 12 |13.03 
V9 6.29 
1936 47 | 4A 214.9 14.4 
vV 2 7.25| 15.2 
1956 67|4A 216.05) 23.2 10.6 | 68.5 
v2 
Average Relearning Time |% 
EN" Savings —Á 
over 1915 
1920 31 21 A 12 | 7.24| 44.44 
v9 3.41) 45.08 
1936 47|SA 415.2] 60. Relearning of mate- 
V4 3.9 | 38. rial relearned in 
1920 
1936 4718A 4|5.4| 58.6 Material read many 
V4 3.9| 38. times to others but 
not memorized in 
1920 
1956 67|8A 44|8.9 | 31.7 Relearning of mate- 
Y d 6.75) —7.3 rial relearned in 
1920 and again re- 
learned in 1936 
1956 607|4A 2|7.22| 44.6 Relearned in 1920. 
v2 5.38) 14.5 Not reviewed 1936 
1956 67} 4A 2| 7.50) 42.6 
v2 6.9 | —9 Read to many others 
in 1920 but not 
formally relearned 
1956 16 A 8] 8.13) 37.7 
total V 8 6.57| —4.45 
Savings Savings 
over 1956 over 1956 
1957 68 | 4A 20. Relearning after 8 
v2 37.2 | 8.46, 20.3 onthe 
"RA purs auditory presentation, V — by visual presentation. 


» Time in minutes. 
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this material was relearned in about 38% less time than was re- 
quired to learn new material. It has been pointed out that the 
incrense in learning time in 1956 was greater for the visual than 
for the auditory method. It would appear, then, that the seemingly 
large loss between 1915 and 1956, especially following visual pres- 
entation, is a function more of age than of lapse of time per se. 


It may be said, too, that the original learning by the visual method 


Was accomplished at a considerably greater speed for this subject 
than that of others who engaged in the sarly experiment. 

In summary, the ability to memorize by rote meaningful prose 
Was by no means lost at age 67 so far as this writer was concerned, 
tho the rate of learning had deereased. The decrease was more for 
the visual method of presentation than for the auditory, tho the 
visual was still the more effective method. Retention of material 
learned and reviewed at earlier times was substantial fór that re- 
learned by the auditory method. The results for material relearned 
visually were not.so clear. There was probably a good deal of re- 
tention, but the factors associated with age seem to be more op- 
erative here than in the auditory mode. . 

It is to be hoped that others for whom early records are avail- 


able will make comparisons similar to those reported here. 
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THE RELATIONSHIP OF CERTAIN PERSON- 
ALITY MEASURES TO EXAMINATION 
PERFORMANCE UNDER STRESS' 


New A. Carrier 
Southern Illinois University 


One of the growing edges of recent psychology has been the 
study of the relationship of anxiety and behavior. Taylor (17), 
Taylor and Spence (19), Montague (10), and Lucas (6) have 
reported studies of the performance of Ss classified as anxious. 
Other investigators have tried to increase the anxiety of Ss by 
manipulating the stressful aspects of a testing situation and then 
measuring performance (3, 5, 12, 16). In general, these studies 
indicate that anxiety facilitates performance on simple tasks and 
inhibits performance on more complex activities, 

Perhaps more fruitful than the mere demonstration of a re- 
lationship between anxiety and performance are attempts to de- 
termine the extent to which certain personality characteristics 
mediate the anxiety-performance relationship. Deese, Lazarus, and 
Keenan (1) reported an interaction between Winne Neuroticism 
Seale scores and conditions of shock stress under which Ss learned 
nonsense syllables. Their findings failed to appear, however, when 
Lazarus, Deese, and Hamilton (4) repeated their procedures using 
a more complex learning task. High anxiety now apparently in- 
hibited rather than facilitated performance, While several other 
studies have tried to relate personality to performance under stress 
(e.g., 11, 18), the differences reported are generally not of large 
magnitude. There is a need for further attempts to relate other 
personality measures to behavior under Stress, 

Motivation for the present study develo 
terest in the work by MeKe 
Michigan on the effect of 


j ped in part from in- 
achie and others at the University of 
anxiety on the examination performance 
of college students (8). While it is generally believed that small 
amounts of anxiety facilitate many behaviors, an assumption in 


1 Adapted from a dissertation submitted to the Doctoral Program in Social 
Psychology, University of Michigan. Grateful acknowledgement is extended 
to Dr. W. J. McKeachie for skillful guidance du 


ting this research and for 
helpful reading of an early draft of this manuscript, 
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MeKoeachie's work was that most students come to a course ex- 
amination situation with so much anxiety about the outcome that 
any additional anxiety is likely to have an adverse effect on their 
performance. If this is true, it is possible to experimentally manip- 
ulate the stressful aspects of an examination situation and measure 
the corresponding effects on performance. This was done in the 
present study. The data were analyzed, however, according to the 
scores the Ss had obtained on four previously administered person- 
ality measures. 
Method 

The personality measures. The personality variables selected 
were four that seemed to have some promise of being related to 
classroom examination performance. They were the SA-S per- 
sonality inventory developed by Smith and reported elsewhere 
(13, 14, 15), and the need for achievement and need for affilia- 
tion measures developed by MeClelland and others (7). The SA 
dimension purports to be a measure of “permeability.” Persons 
scoring high in permeability tend to be sensitive to stimuli, im- 
pulsive, socially oriented, and low in ego strength. The S dimension 
("stability" at the high end and “anxiety” at the low end) is 
said to reflect, frec-floating anxiety, nervous tension, and desur- 
gency. The relevance of these measures to examination perform- 
ance under stress should be evident. If the highly permeable 


student is placed in a testing situation which carries a large 
y may produce & level of anxiety 


potential threat, his sensitivit i j 
which will cause a greater performance decrement than that o 
the relatively impermeable student. And when stress 18 introduced, 
the low stability student may show a greater performance de- 
crement than that displayed by the more stable student. 

The n Ach variable was used in view of the suggestion (7, pp. 


110-111 that in males it discrimi 
oriented A ae and failure-fearing (moderate ^ zu pon 
Sons. These differently oriented persons may differ s^ h p e 
kinds of attitudes they bring to à testing situation = in t 
abilities to handle the stressful aspects of it. The n Aff variable 
was used because of the implication that high n Aff persons are 


i : i iative ties than 
m " P b deprived of their affiliative ties 
ade more anxious by ih n Aff student were removed from 


low n Aff persons. If the hi e 
some of his fellow students, from his teacher, and from his usual 


nates between success- 
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classroom and required to take an important examination in a 
strange room with a strange "unaffable" teacher, he would ex- 
perience more anxiety than a comparable low n Aff student. 

The expected relationships of these personality variables to 
course examination performance under stress were formalized in 
a number of hypotheses: 

1. High SA students in a high stress situation will score lower 
than high SA students in a low stress situation; no differential 
response to stress will be shown by moderate or low SA students. 

2. Low $ students in a high stress situation will score lower than 
similar students in a low stress situation; no differential response 
to stress will be shown by moderate or high S students. 

3. Moderate n Ach male students in a high stress situation will 
score lower than similar males in a low stress situation; no dif- 
ferential response to stress will be shown by low or high n Ach 
males. (No prediction was made for female students.) 

4. High n Aff students in a high stress situation will seore lower 
than similar students in a low stress situation; no differential 
response to stress will be shown by moderate or low n Aff students. 

Because of the uncertainty of the validity of these relatively 
new personality measures, it was decided that the 
of differences between groups should be tested with 
test. 

Subjects. The Ss were students in Introductory General Psy- 
chology at the University of Michigan during the 
1955. A regression equation was devised to predict the scores of 
the x two Doe examinations. The weights for these indepedent 
fes Couns gegen Wore grade-point aver 
the first two course examinations The dui aru ie va es a 
id wechdllus quse aiea uia 1 weights for these independ- 

^ I ata on students in the same 
course during the previous semester, 

Vhen the predi ird examinati 
dy were ae t e qi ination score wor comput 
assigning students to a low or high E. Pc E PERE 
where they took their third examinati "à LC aoe a gii 

ation. This “matching” pro- 


cedure controlled for certain intellectual and motivational factors 
which influence performance. The sample consisted of 125 students 
in each of the experimental groups. The high 


significance 
a two-tailed 


spring semester, 


stress group con- 
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sisted of 43 male and 82 female students. The low stress 

contained 60 males and 65 females. In assigning Ss to oe 
groups, no thought was given to their sex. The resulting dis 
erepaney between the number of males in the high and low stress 
groups must be attributed to the perversity of chance. BE 
_ Stress situations. The operations by which the experimental 
situations were to be made differentially stressful were selected 
evance to the personality measures em- 
ployed. In the high stress group: the examination was given in a 
strange room: students were very closely proctored ju strange 
instructor; they were told there would be a guessing penalty in 
scoring the multiple-choice exam; two very difficult “practice” 
questions preceded the examination proper; and the instructor 
maintained a cold, abrupt, and rather severe demeanor through- 
out the examination. In the low stress group: the students took 
their examination in their usual classroom; their usual instructor 
was with them; they were told there would be no guessing penalty ; 
no formal instruetion sheet or practice questions were given to 
them; they used a special answer sheet which provided a space 
after each question for any comments they wished to make (8); 


and the instructor tried to maintain a warm, interested, relaxed 
ssion that he was sincerely interested 


ay he legitimately could. 


As each student finished his examination he answered a question 
lutehed" at any time during the 


whieh asked whether he had "c 
examination. This question Was ineluded to determine whether 
differences in felt anxiety would accompany the stress conditions. 


with an eye to their rel 


manner and to give the impre 
in assisting students to do well in every W 


Results 
p < 2) more clutching 


an by low stress stu- 
assumed differential 


significantly ( 
‘ess students th 
vort for the 
ntal situations. 

n into three groups represent- 
Test performance for each 


a tn d test showed 
dents oe by the high str 
stress his provided some SUPI 

S sfulness of the two experime 
SA: The SA dimension was broke 


ing hi ; s 
a high, mid, and low permeability. 
these groups of students was then analyzed under the two ex- 


Derimenta] conditions. High SA (permeable) males and females 
= were significantly affected by stress (Tables 1 and 2), whereas 
Id and low SA students were not. These findings support the 
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TABLE 1 
EXAMINATION Scores or SA Mates UxpEn Stress CONDITIONS 


Mean Predicted Mean mes " —— 
Stress SA N = Score z Score Relative “Gain 

Low Lo 17 .29 57 .28 

Mid 17 —.03 —.21 —.18 

Hi 20 = 20 19 ido" 
High Lo 11 .10 .44 .34 

Mid 20 .04 .12 .08 

Hi 12 .13 .00 —.1là* 


* Significance of difference between starred values: t = 2.26, p < 05 
(2 tails). 


" TABLE 2 
EXAMINATION Scores or SA FEMALES UNDER STRESS CONDITIONS 
M i Exam z pnis 
Stress SA N E Dreier Mes n ^ Relative “Gain” 
Low Lo 22 .26 .21 — 05 
Mid 21 +2 .05 —.07 
Hi 18 —.17 14 .31* 
High Lo 21 .28 26 —.02 
Mid 27 .08 —.16 —.24 
Hi 33 —.18 — .38 —.25* 
* Significance of difference between starred values: t = 2.43; p < 02 
(2 tails). 


hypothesis that stress is more detrimen 
formance of highly permeable student: 
erately or relatively impermeable st 


S: The hypothesis with regard to the Stability dimension re- 
ceived only tenuous support from the data. Low stability males 
displayed a performance difference under Stress in the predicted 
direction (Table 3). The t of 1.97, however, is not very reliable 
(10 > p > .05; 2 tails). No difference approaching an acceptable 
level of significance was found in the case of females. 

n Ach: The prediction with regard to need for achievement was 
unsupported. No differences in performance for high, mid, or low 
n Ach males under the two stress con 


; ditions were observed. Anal- 
ysis of the females’ data, however, showed a significant differential 


tal to the examination per- 
s than it is to that of mod- 
udents. 
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TABLE 3 
Examination Scores or S MALES UNDER Stress CONDITIONS 


Mean Predicted Mean Exam s 


Stress S N z Score Score Relative “Gain” 
Low Lo 19 —.03 -37 .40* 
Mid 17 —.01 .03 .04 
Hi 19 .08 15 .07 
High Lo 15 18 .10 —.03* 
Mid 15 .03 .37 .84 
Hi 13 .08 .01 —,07 


* Significance of difference between starred values: ¢ = 1.97; p < .10 
(2 tails). 


TABLE 4 
Examination Scores or n Acn FEMALES UNDER STRESS CONDITIONS 


Mean Predicted Mean Exam s 


Stress n Ach N s Score Score Relative “Gain” 
Low Lo 21 .07 94 .27* 
Mid 14 —.01 .12 .18 
Hi 20 .01 .15 .14 
High Lo 21 —.04 —.54 — .80* 
Mid 24 —.10 —.21 —.11 
Hi 21 .17 mut —.06 


I C BERE REPRE pe NU 
* Significance of difference between starred values: ( = 2.96; p < .01 
(2 tails), 


Teaction to stress by the low n Ach females (Table 4) such that 
their performance was detrimentally affected by stress. 

n Aff: The hypothesis with regard to the n Aff dimension was 
Dot supported by the data. Neither males nor females displayed 
Performance differences to stress related to their n Aff scores. 

SA-S types: Since the SA and S measures correlated only .12 
for males and .11 for females, another way of extracting meaning 
from them is to treat them as essentially noncorrelated dimensions 
defining four personality types—one at each of the four resulting 
quadrants (14). Median breaks were made on each dimension, 
and Ss scoring within one point of either median were discarded. 
The remaining Ss were classified as one of four SA-S types. 
Analysis of performance difference of these types showed no dif- 
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TABLE 5 
EXAMINATION Scores or SA-S Tyre FEMALES UNDER Stress CONDITIONS 


Ty P 
— o —— Mean Predicted Mean Exam z Relative 
Stress SA s N z Score Score “Gain 
Low hi hi 14 —.06 .25 ,31* 
lo hi 13 .si —.00 —.40 
lo lo 10 .12 .20 AT 
hi lo 9 —.25 —.30 —.08 
High hi hi 19 —.15 —.4l —.26* 
lo hi 9 43 .36 — 07 
lo lo HH .22 -25 038 
hi lo 19 —.18 —.49 -H 
* Significance of difference between starred values: ( = 2.04; p < -05 
(2 tails), 
TABLE 6 
EXAMINATION SCORES OF Acu-Arr Tyre Mates UxpEn Stress CoNpDITIONS 
Type 7 . | mum i 
E ar m P Mean Predicted Mean Exam s Relative 
Low hi hi 13 ~.09 248 —.0 
hi lo 9 .36 .44 .08 
lo lo 17 —.07 .97 AT 
lo hi 15 .10 E OL 
High hi hi 12 .08 .04 —.04 
hi lo 9 18 22 04 
lo lo 10 .00 .39 Boh 
lo hi 5 24 .35 Ad 


*t = 2.75; p < 02 (2 tails). und 
** t = 10; p < 15 (2 tails). 


ferential reaction to stress for males, H 
5) who were high SA and high § t 
depressed performance under stress, 


Ach-Aff types: A similar type analysis was made for the n Ach 
and n Aff measures which correlated only .08 and .10 for males a? 
females respectively. The males (Table 6) ahawed no differential 
reaction to stress, but an interesting observation was an appare? 
improvement of low n Ach-low n Aff males under both experi" 
mental conditions. The female Ss (Table 7) 
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ypes showed significantly 
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TABLE 7 
EXAMINATION Scores or Acir-Arr Typr FEMALES 
UNDER STRESS CONDITIONS 


- be Mean Predicted Mean Exam s 
Stress Ach Añ Pd z Score Score 

Low hi hi 16 —.12 —.01 M 
hi lo 10 .26 E .15 
lo. lo 12? .99 .82 .53* 
lo hi 17 —.15 —.07 .08 

High hi hi 14 04 —.05 —.09 
bhi lo H .21 .19 —.02 
lo lo 19 .07 — .32 — .25* 
lo hi — 19 —.10 —.40 —.30 


* Significance of difference between starred values: ¢ = 2.60;.p < .02 
(2 tails). 


Ach-low n Aff showed significantly depressed performance under 
stress. 


Discussion and Conclusions 


The belief that individual performance differences in response 
to stress during examinations are related to detectable person- 
ality variables has received some support from this study. One 
conclusion suggested by our analysis is that there may be a sex 
difference in response to stress. The greater frequency of sig- 
nificant findings for females is notable. Mead (9) has proposed 
that the role of the American female is social and dependent be- 
havior, not career accomplishment in competition with males. 
Field (2) and McClelland et al. (7) have reached a similar con- 
clusion. If we can regard many female college students as persons 
Who have not accepted academic achievement as one of their strong 
intrinsic motives, it may be possible to regard them as relatively 
Poorly equipped to cope with stress in achievement situations 
which they pereeive as only incidentally related to the gratifi- 
cation of their social motives. The implication here is that, if an 
activity is relatively unimportant or uncentral to one’s dominant 
Motives, one is poorly stimulated to develop those skills or de- 
fenses which keep the performance of this activity relatively un- 
affected by stress. 
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achievement situations is especially true of low n Ach females who 
may not have successfully resolved the conflict in favor of ac- 
cepting academic achievement as one of their strong motives. 

The n Aff variable has shown no independent discriminatory 
value in this study. The hypothesis with regard to n Aff is not 
supported. 

The Ach-Aff type analysis revealed no relationship of these 
types to examination performance under stress for males, The ap- 
parent improvement, regardless of stress on this test relative to 
two previous tests, of the low n Ach-low n Aff males may indicate 
a kind of student who tends to “put off” study effort until rather 
late in a semester when the threat of failure induces intense moti- 
vation to study. 

The low n Ach-low n Aff females did respond differentially under 
stress. This indicates that, although the n Aff variable has no pre- 
dictive value when used alone, when combined with the n Ach 

measure it does help indentify a type of female adversely affected 
= amination stress. 
Xwxssspenized that a student's performance on an exami- 
nined by many factors. Intelligence, amount of 
ject matter, and familiarity with test-taking 
nly aecepted as determining influences. Our 
Personality characteristics of the student and 
the testing situation must also be considered 


Summary 
- he relationship of four personality variables to student 
Performance on course examinations under experimentally ma- 
nipulated stress was investigated. The findings were that the 
Students most detrimentally affected by the stress conditions em- 
Ployed here were: highly permeable students, low stability males 
females with low achievement motivation, females of high Mg 
meability combined with high stability, and females of low achieve- 


Ment motivation combined with low affiliation motivation. Also 
: , 
Stress had a greater detrimental effect on females than on males. 
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The writer hesitantly raises the point that, if this latter hy- 
pothesis is true, it may mean that it is easier to produce stress 
effects in contrived laboratory experiments than it is in experi- 
mentally manipulated “real life" situations in which the Ss are 
highly involved and to which they bring well-practiced defenses. 

The SA (permeability) dimension has some promise as a means 
of differentiating individual performance response to stress. Since 
the conceptual meaning of this dimension includes sensitivity to 
influences from others, it is possible that the highly permeable 
student was most affected by the experimental manipulations 
which involved the behavior of the instructor during the exami- 


nation. However, since the design included a number of stress 
manipulations, specification of the s 
conjectural. 


The finding that the high SA-high S female students showed the 
greatest performance decrement under stress of all the SA-S types 
is somewhat contrary to what might have been expected. The high 
SA-low S type of student has been described as “ 
inative, dependent, disorganized, uncontrolled 
The high SA-high S student has such characteristics as sensi- 
tivity, changeability, sociability, impulsivity, and stability. Both 
of these types are high on SA and, hence might be expected to show 
performance decrement under stress, But the high SA-high S per- 
son is relatively stable, while the high SA-low S person is rela- 
tively anxious. It might have been expected, then, that the high 
SA-low S female students would have performed less well under 
stress than the high SA-high S female Students, The results indi- 
cate just the reverse. It may be that the high SA-low S female 
students were already so near maximal tension that the experi- 
menter was unable to reduce their tension in the low stress situa- 
tion and/or unable to increase it 
tion—and, hence, failed to get an 


pecific effects of any one is 


sensitive, imag- 
, and cycloid" (15). 


The n Ach variable has shown a 
Ach hypothesis with regard to 


f these students was the lowest 
of the types in both stress groups 
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achievement situations is especially true of low n Ach females who 
may not have successfully resolved the conflict in favor of ac- 
cepting academic achievement as one of their strong motives. 

The n Aff variable has shown no independent discriminatory 
value in this study. The hypothesis with regard to n Aff is not 
supported. 

The Ach-Aff type analysis revealed no relationship of these 
types to examination performance under stress for males, The ap- 
parent improvement, regardless of stress on this test relative to 
two previous tests, of the low n Ach-low n Aff males may indicate 
a kind of student who tends to “put off” study effort until rather 
late in a semester when the threat of failure induces intense moti- 
vation to study. 

The low n Ach-low n Aff females did respond differentially under 
Stress. This indicates that, although the n Aff variable has no pre- 
dietive value when used alone, when combined with the n Ach 
measure it does help indentify a type of female adversely affected 
by examination stress. 

It is recognized that a student's performance on an exami- 
nation is determined by many factors. Intelligence, amount of 
knowledge of the subject matter, and familiarity with test-taking 
Procedures are commonly accepted as determining influences. Our 
results indieate that personality characteristics of the student and 
the stressfulness of the testing situation must also be considered 


to determine performance. 


Summary 


The relationship of four personality variables to student 
Performance on course examinations under experimentally ma- 
nipulated stress was investigated. The findings were that the 
students most detrimentally affected by the stress conditions em- 
Ployed here were: highly permeable students, low stability males, 
females with low achievement motivation, females of high per- 
Meability combined with high stability, and females of low achieve- 
Ment motivation combined with low affiliation motivation. Also, 
Stress had a greater detrimental effect on females than on males. 
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PROBLEMS OF PROGENY RELATED TO 
PATERNAL EDUCATION 


Jamus Drascow 
University of Buffalo 


During the process of studying psychological breakdowns among 
college students (1), it was noticed that they did not present the 
often observed and more obvious symptoms of anxiety or worry 
that so often precede breakdowns at the adult level. The overt 
expression of anxiety and worry appeared to follow rather than 
precede the breakdown episodes. In every instance, the student's 
college career was prematurely closed. 

Another study (2) specifically investigated the premature clos- 
ing of college careers; one of the findings was that the academic 
mortality rate was significantly less for students who lived in 
residence halls than it was for those who lived at home. While last- 
ing longer and experiencing fewer breakdowns, the dorm students 
expressed more anxieties and worries on a simple “worry check 
list.” This latter finding reinforced the earlier observation that 
those who broke down did not show manifest symptoms of anxicty 
prior to collapse. These previously unrelated results would ap- 
pear to support the popular theoretical belief that people who are 
able to express their anxieties may be “draining off" whatever it 
might be in others that is supposed to stimulate a culminating 
collapse. 

The present article is aimed at one of the more measurable as- 
pects of the earlier researches. It had been found that not only 
did the residence hall students survive longer and express more 
Worries, but they also had fathers with more education. For ex- 
ample, the majority of the fathers of the dormitory students had 
been exposed to at least one year of college, while the majority 
of the fathers of the students who lived at home had not suc- 
ceeded in finishing high school; the difference was significant 
well beyond the .01 level.’ This finding fostered the problems of 
crossvalidation and determination of degree of association, if 


*No statistically significant relationships of discrimination or association 
Were found for maternal education. 
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any, between paternal education and number of worries professed 
by progeny. The present project focuses on these issues. 


Procedure 


The folders of 50 students were drawn sequentially from the 
university’s alphabetically ordered files. Extent of paternal edu- 
cation was obtained from Personnel Questionnaires that entering 
students completed; the number of worries was obtained from the 
questionnaire’s "worry check list" which is reproduced in Figure 


1. Each student's Minnesota Multiphasie Personality Inventory 
(MMPI) was classified as “stable” or “ 


unstable” according to 
the following criteria: stable profiles h 


ad all scales within normal 


Considerable 


7 i: Little or 
"Ait" Mole Noort 
t: Poor healthy... riora veces T - 
2. Fear of rats, mice or snakes... " m——— 
3. Unfair treatment by teachers... . —— 
4. Harmony in the home: 2c €"— 
5. Experiencing a sense of inferi- 
ONDI cues 63 ste nosset waa a eren, — — 
6. Being quite depressed when ig- 
MOOSE cues homeen Cerne — dices 
7. Preferring recreation alone... . one s 
8. Being POO A a sue ceu ee 
9. “Day-dreaming” while working. . € 
10. Occupational possibilities, ... . . ae 
11. Having my feelings easily hurt. . hae 
12. Finances at home... ..,, ae 
13. Difficulty in making friends... ee 
14. Never being asked to direct 

Cin gi RN —Ó 
15. Diffieulty in concentrating... ._ . ares 
16. Often depressed and in low Spirits eme: 
17. Slow to forget unpleasant experi- 

ONO i-o ce ae EDU nein sus dr. m—— 
18. No enthusiasm about life's possi- 

ENO per MEN nmm 
19. Worrying about little things... . — num 
20. Suspecting people of "under. 

handed” motives. ............., ee ae 
21. Family supervision and approval 

OF DONNI OT i ue crises sten os FETTE 


Fic. 1. Tue Worry Cuecx Lisr 


Personality Problems and Paternal Education 


g 
2 
[^ 


TABLE 1 
RELATIONSHIP BETWEEN FATHERS’ EDUCATION AND THEIR COLLEGE 
CHILDREN’S EXPRESSED WORRIES WHEN THE Two VARIABLES 
ARE CUT TO MAXIMIZE DISCRIMINATION 


Fathers’ Education 


College No College 
>3 12 13 
Worries 
33 3 22 


limits except Mf which is often elevated among college students, 
and F — K < —12 (3). All other profiles were regarded as unstable 
(4). 

Results 


A discriminant function analysis between the number of worries 
and the extent of paternal education resulted in an optimum 
maximizing cutoff point for discrimination as a function of worry 
at the three-worry level; the resulting 2 X 2 matrix is presented 
as Table 1. The cosine- pi tetrachoric correlation coefficient on 
this matrix is .66, and x? is significant beyond .01. With these 
quantitative results, it may be qualitatively stated that there is 
indeed a sizeable and significant relationship between the number 
of worries expressed by a student and the number of years of edu- 
cation his father has had: the more educated the father is, the 
more worries the progeny profess. An analysis of the variance of 
Worries among students resulted in a statistically insignificant 
F. This indicated that no one particular worry or set of worries 
listed in Figure 1 would serve to discriminate the children of par- 
ents with college from the children of parents without college. 

The finding of F > .05 for worries among students was cor- 
roborated by the results obtained with the MMPI. The MMPIs, 
when classified as stable or unstable according to the previously 
indicated criteria, showed no statistically significant relation- 
Ship to either number of worries indicated on the check list or to 
&mount of paternal education. 


Conclusion 


A formulation from all the foregoing is that the child of a college 
Parent may be more motivated for college than the child of a 
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groups in Brigham’s book is clearly contradictory to the experi- 
ence of the present writers. 


Method 
Subjects 


After the termination of the German-Polish war of 1939, a considerable 
part of the Polish forces, together with thousands of civilians, withdrew to 
the neighboring states of Hungary and Rumania. Subsequently, almost 
every member of this group was evacuated to France according to the pre- 
war alternate plan formulated by the Polish government. When, later in 
the war, France experienced the same fate as her Polish ally, nearly all 
these Polish people, military and civilian, escaped to the British Isles, Fol- 
lowing the Stalin-Sikorski Agreements of 1941, affecting over 1,000,000 Polish 
citizens previously deported to the Soviet Union, a second substantial 
group of Poles was brought to the United Kingdom. Again in 1944, after 


the Allied invasion of the continent, many Polish people who had been 
unwilling conscripts in the German 


^ E i army and labor battalions were re- 
united with the Polish national force 


, took up their station in England. 
had previously found refuge 

in the Near East. Thus in 1945, there were in Grent en bo 400,000 
and were represented, and this group 
was generally regarded as a fairly representative sample of the Polish nation. 
The first constructive approach to the field of applied psychology under 
Polish auspices was the Polish Military Psychodemographical Institute, 
organized in 1944, which had its central office in London, a branch office in 
Edinburgh, and several selection board centers distributed throughout the 
United Kingdom. Several officers, already Possessing extensive educational 
backgrounds, completed a specialized training program in psychometric 
theory and practice at Edinburgh University under the diventiont of the 
late Godfrey H. Thomson and the late James Drever, To sup plement this 
el of — Id personnel, a number of peyeholozia examiners 
were trained by the institute. In additi ; 'on- 
tinued their studies at the He Tur of the former a? 


t i t sities of Edinburgh and London. These, 
while engaged in their graduate Studies, were also carrying on research 


projects dealing with the measurement of intelligence apitudes, and per- 
sonality factors as exhibited by the Polish population i 


The Polish sample, tested by one of the writers (Wysocki) and 
his colleagues according to their military assignments, consisted 
of 15,267 men, including 651 officers. In addition, 666 adults (597 
men and 69 women) : 


) were tested by the same writer in the fol- 
lowing groups: 88 male students of 


à f a Polish university, 100 male 
graduates of Polish high schools, 232 male and 39 female high 
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school seniors, 105 men who had completed elementary school 
education, 36 Polish elementary school teachers, 36 Signal Corps 
soldiers, and 30 members of the Polish Women Navy Auxiliary 
Service (WAVES). 


Test 


Raven's Progressive Matrices (5) were administered to all Ss. 
The test has been widely used in England since 1938 and was 
considered to be particularly effective in this context because of 
its nonverbal character.* 


Raven's test consists of 60 matrices from each of which a part is missing. 
The Ss hus to choose the missing part to complete a pattern from six 
to eight alternatives drawn below each pattern. There are five sets of 12 
patterns each. The first series requires mainly ability in accuracy of dis- 
crimination, but later series become progressively more difficult requiring 
capacity to grasp analogies, capacity to form comparisons and other logical 
relations. A person's intellectual capacity is assessed on the basis of his 
total score. The test correlates highly with Terman-Merrill scale (r = .86) 
and has a high saturation (.82) with the Spearman's g factor. The test may 
be administered with or without a time limit, and it can be used either as 
an individual test or as a group test. 


The time limit of 20 minutes, as used by the British armed forces, 
British industrial institutions, and British schools, was imple- 
mented in all eases from which the following data were derived. 


Results and Diseussion 


The results obtained by the writer’s colleagues are reproduced 
in Table 1, and a few groups tested by the present writer 
(Wysocki) are specified in Table 2. 

As one would expect, Tables 1 and 2 show significant differences 
in test scores between officers and enlisted men, as well as among 
several selected groups of Ss. It is interesting to note that various 
groups of Polish nationals, as specified in Table 2, are ranked in 
about the same order as this test ranked similar English groups 
(14). If we look later at Tables 3 and 4, we also find (a) a 
similar hierarchical order of groups of Ss selected according to 


eo 

2 Coefficient of correlation with British Army S.P. 15 (verbal intel- 
ligence tests adapted by Wysocki for Polish population) was r = 697 + 
036 

* Groups 4, 6, 10, and 11 are also included in Table 1. 
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TABLE 1 
Raven’s Matrices (1038) MEAN Scores or PoLisH OFFICERS 
AND ENLISTED MEN 


Ss N 


Mean raw score SD SDM Mean IQ 
Officers 651 15.20 7.43 -29 122.7 
Enlisted men 14,616 32.50 10.39 09 103.0 


TABLE 2 
Raven’s MATRICES (1938) MEAN Scores or SEvEnAL GROUPS 
or Porrsu ApuLts 


Mean raw Mean stand- Mean 
Ss score ard score — IQ 


University students (88 men) 

High school graduates (100 men) 

High school seniors (39 girls) 

Military staff school students (87 officers) 
High school seniors (232 boys) 
Cadet-officers (564 men) 


1.85 — 130 
1.55 124 
1.28 120 
1.30 — 120 
naro 17 


97 116 
Elementary school graduates (105 men) = 115 
Elementary school teachers (36 men) .59 100 
Signal corps soldiers (36 men) 43 106 
Sailors (727 men) 30.71 12 102 
Infantry n.c. officers (598 men) 28.68 — 08 99 
WAVES (30 women) 26.00 — 29 95 


other than military criteria and (b) similarity with results ob- 
tained from the intelligence testing of English speaking Ss in Great 
Britain and in this country (5, 7, 9, 12, 13, 14). This shows 
that intelligence test scores of Polish Ss have distributions similar 
to those of English speaking populations and are quite comparable 
not only with corresponding English scores but also with those 
yielded by other intelligence tests in Great Britain and the United 
States for similar groups. 

The reader, however, may wonder at the relatively low IQ 
levels of the Polish elementary school teachers as shown in Table 
2. This may be explained by the fact that in prewar Poland 
elementary school teachers were trained in the five-year teachers’ 
training schools (Seminarium) after having completed a seven- 
year elementary school. Thus, their education was on a high school 
level and not, as in this country or in Great Britain, on a college 
level. The standard of the Polish elementary school teachers’ 


) 


z 
is 
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To THE LEVELS or EDUCATION 


Raven's 
Percent- (1938) 
Grade age of mean raw Sta 
School completed S sample score score IQ 
Elementary schools 1 4.00 26.57 —.28 95 
3 9.50 26.92 —.25 96 
41 and 5 24.20 E —.11 98 
6 and 7 30.40 29.22 —.02 100 
Lower technical and com- 10 17.50 35.00 .53 108 
mercial schools 
Secondary schools 11 1.885 12.90 38.94 -91 115 
Academie schools 12 219 1.50 42.55 1.26 120 


and above 


education was much lower than that of those students who grad- 
uated from the Polish secondary schools, called Gymnasium, 
preparing their students for subsequent university studies. Table 
2 also shows that the IQs of the army higher rank officers are lower 
than the IQs of high school (Gymnasium) graduates. This lends 
support to the present writers’ contention that the selection to 
the higher officer ranks in the Polish army was not always carried 
out on an objective basis. . 

The test results obtained from enlisted men were subjected to 
percentage analysis with regard to several characteristics of the 
Polish Ss tested, Table 3 shows the distribution of IQ points with 
regard to educational levels. Here, again, one may observe a 
hierarchical distribution of IQs of Ss according to levels of educa- 
tion, from elementary school at the bottom to universities at the 
top. Again, an English intelligence test (Raven’s matrices) ranked 
the Polish Ss in about the same way as it did English speaking 
Ss (5, 8, 9). 

Table 4 shows the distribution of IQs with regard to occupa- 
tional backgrounds of Polish adults. The differences between 
urban and rural intelligence in this country and elsewhere were 
widely discussed in the last two decades (2, 3, 6, 10). Here, in 
Table 4, the marked differences in mean scores between agricul- 
tural and industrial groups seem to confirm the influence of urban 
and rural environment in the Polish population also. Table 4 also 
Shows significant variations of intelligence with regard to dif- 
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TABLE 4 


DISTRIBUTION oF IQs or Potts ExLrsTED MEN ACCORDING 
TO OCCUPATIONS 


Raven's 
(1938) Mean 
Percentage mean raw standard Mean 
Occupation N of sample score score IQ 
Domestic services 44 .30 26.76  —.27 95 
Agriculture 9,281 63.50 27.97 —.15 97 
Industry 2,052 20.20 32.51 .29 105 
Transportation 234 1.60 33.06 34 — 106 
Public services 219 1.50 34.25 AG 108 
Commerce 59 40 38.22 .75 112 
Other occupations 1,708 12.30 40.03 1.07 116 
Schools 29 .90 — 42.03 1.2) 119 
TABLE 5 


Ravens MATRICES (1938) MEAN Scores or Encuisu (9) anp Ponisi 
ADULTS ACCORDING TO AGE Lev 


2 P 3; 5 Chronological ane in years d 
English* 5,857 44 44 42 40 38 35 
Polish 3,247 31 31 29 27 25 24 


* No time limit for English adults (9, p. 15). 


ferent occupational groups—again, similar 
with the same test and the same time limit by British authors 
(12, 13, 14). Also, the age groups show marked differences in 
their responses to Raven’s intelligence tes 


Rh i t. The decline in test 
scores in our sample, as shown in Table 5, may be observed already 


to the results obtained 


asured by Raven’s Pro- 
ups may also be noted 
(9, 12). 

tion show that scores on 
ith caution. The test score 


onal groups. There are other 
Cloeconomie status, and emo- 
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TABLE 6 
Raven's Marrices (1938) MEAN Scores or ENG Lisi AND 
Poist MEMBERS OF THE ARMED FORCES 


Ss English Polish D i SDp t 

Officers N 643 651 
M 46.00 45.20 .80 .37 2.18 
SD 5.70 7.43 
SDs .22 .29 

Enlisted men N 15,000 14,616 
M 34.00 32.50 1.50 .11 14.18 
SD 7.70 10.39 
SDs .06 .09 


tional reactions which intelligence tests do not measure, but which 
may affect intelligence test scores to a marked degree. One of 
the present writers has received from the British War Office Selec- 
tion Center (11) data on Raven’s Progressive Matrices (1938) 
obtained from the testing of English officers and enlisted men. 
The test was administered to English adults at the same time our 
data from the Poles were being collected, and the same procedure 
of the test administration was applied. Thus, the English data 
Were quite comparable to the data reproduced in the present in- 
vestigation. Table 6 shows the critical ratio between the English 
and Polish officers (t = 2.18) statistically insignificant, and the 
critical ratio between the English and Polish enlisted men (t = 
14.15) significant at the .01 level in favor of the former. Standard 
deviations of both groups indicate, however, that the groups of 
English Ss were more uniform than the groups of Polish Ss. In 
other words, the Polish samples were characterized by the wider 
Tange of intelligence levels which increased the critical ratio. 
Let us now compare the English and Polish samples in Brig- 
ham’s study of 411 and 383 Ss respectively (1, pp. 124, 126, 142). 
Mean combined score for the English was 14.87 and for the Poles 
10.74, and the critical ratio 33.4 in favor of the former. In our 
investigation, there was no significant difference between the of- 
ficers and not so large a difference between enlisted men of both 
national groups. In the majority of cases, the Polish soldiers in- 
cluded in the present study encountered psychological tests for 
the first time in their lives, whereas the English soldiers were 
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already familiar with testing while attending schools. Jones, diz- 
cussing the same problem with regard to the American Ss writes: 
5... allowance for town familiarity with tests would give a dif- 
ference of town and country of five points, which is in agreement 
with the eventual conclusion on the American results” (6, p. 5). 

Fowler in his study (4) on 10-year-old pupils from 14 ele- 
mentary schools of Detroit and Hamtramck, Michigan, has found 
no significant differences in mental test results of the Polish and 
white American nonethnie groups. Whereas in the Brooklyn, New 
York, arca intelligence tests results of Polish children are con- 
sistently lower than those of American nonethnic children. These 
results point out the cffect of socioeconomic status on scores ol 
conventional intelligence tests. One has to recognize psychometric 
limitations of intelligence tests unless we really can develop *eul- 


ture-free” tests—free from cultural learnings and unaffeeted by 
socioeconomic differences. 


Summary and Conclusions 


Available data concerning the level of Polish intelligence (1) 
based on a small and nonrepresentative group of Polish immi- 
grants in the United States were felt by the writers to be in- 
adequate. The aim of this paper was to produce recent and valid 
data on the intelligence of Poles. The Polish sample consisted 
of almost 16,000 adults. As all classes and all parts of Poland 
were included, this sample is the most representative and also the 
most recent one of adult Polish intelligence, 

Raven’s Progressive Matrices (1938) (9) were used, and the 
test results were subjected to percentage analysis according tO 
different military assignments, education 
cupational backgrounds. It is quite 
made test, when administered to various groups of Polish na- 
tionals, not only ranked various Polish groups in about the same 
order as this test ranked similar English groups (5, 7, 8, 9. 11, 
12, 13, 14), but also assigned scores numerically quite compar 
able to corresponding English scores and inde 
able to those yielded by other inte 
and the United States for com 


al levels, ages, and Ed 
a striking fact that the British- 


ed quite compat- 
lligence tests in Great Britain 

parable groups. The results re- 
produced in this study show clearly marked differences in intel- 
ligenee levels (as measured by intelligence tests) of different 
groups caused by different factors such as education, occupation 
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rural and industrial environment, and age levels similar to what 
we may observe in intelligence testing of English-speaking Ss. 

Further, the results obtained from the testing of members of 
the Polish armed forces were compared with the results obtained 
by the use of the same test on the members of English units of 
the British Army (11, 14). It appeared that the differences in 
mean scores between Polish and English officers were statistically 
negligible and the differences between English and Polish enlisted 
men, although statistically significant, were markedly smaller 
than the differences between Brigham's groups of English and 
Polish immigrants. This lends support to the writers’ contention 
that, the more homogenous the groups of Ss we compare, the 
smaller are the differences in intelligence levels we measure (4). 
Until really “culture-free” intelligence tests are developed—free 
of education, culture, and socioeconomic status influerees—any 
comparisons of national groups based solely on intelligenee tests 
scores should be treated with caution. 
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PREDICTING INTERESTS FROM DIFFERENT 
KINDS OF CUES! 


Marvin TAYLOR 
Queens College, New York 


There is little doubt that effectiveness in group relations is re- 
lated to the ability of the individual to perceive accurately the 
interests, intentions, and belicfs of other group members. Gage 
and his co-workers have reported positive correlations between 
such variables (11, 12, 13). Chowdrey and Newcomb found that 
leaders and nonleaders in various kinds of groups significantly 
differ in the accuracy with which they predict relevant opinions 
of group, members (5). Indeed, Steiner states that one proposition 
underlying the rationale of teacher education is that “the more 
knowledge an individual has concerning the intentions, prefer- 
ences, and beliefs of other persons, the more effectively he can 
participate in group activities with those other persons” (15). 

Another area of research that has had a direct effect on our 
knowledge of social perception has been the concern with the 
characteristics of good and poor judges of others (1, 3, 8, 10). 
A third area of research endeavor concerned with social pereeption 
has been the attempt to find out more about the determinants of 
accuracy and inaccuracy of social perception (4, 10, 11, 17). 
Gronlund points out that “in the classroom, teachers are expected 
to judge the knowledge, attitudes, interests, and a host of other 
characteristics about their pupils. Although it is easier to obtain 
evidence in some of these areas than in others, a major share of 
teachers’ responses are determined in part by the accuracy ° 
their perception...” (14). Further evidence reveals that many 
teachers are highly inaccurate in their perceptions of students 
abilities and aspirations (7). 

Examination of the previous research suggests that one of the 
variables which has not been studied in any systematic way is 
the effect of different kinds of cues employed in social perception- 
For example, when a student attends a class for the first time, 
the teacher may perceive his motoric pattern, style of clothing; 


1 Research project completed as part of doctoral dissertation under the 
supervision of Francis J. Divesta, Syracuse University. 
534 


Predicting Interests from Cues 535 


facial expressions, cleanliness, manners, verbal comments, etc., or 
some combination of these cues. It is probably safe to argue that 
social perception is mediated by both holistic and such differenti- 
ated patterns of perception. Assuming that the preceding is true, 
it is probable that the accuracy of an initial social perception of 
an individual is influenced by the mode and content of the cues 
available at that time. It is also probable that increments in the 
accuracy of social pereeption are influenced both by the appear- 
ance of additional cues and their order of availability. If the type 
of initial cues available, the appearance of additional cues, and 
their order of presentation could be controlled, it would seem 
possible to gather some valuable data about the social perception 
process. 

Assuming that the variables mentioned above can be controlled, 
the folowing null hypotheses are testable: s 

1. There is no significant difference between a judge’s accuracy 
in predicting interests of a stranger resulting from exposure to 
different types of introductory cues. 

2. Successive increments of information about a stranger do 
not increase a judge’s accuracy of prediction of a stranger's 
interests. . 

3. Final predictions of a stranger’s interests are not different 
from earlier ones as a result of differences in the order of pre- 
sentation of cues. 


Method 


Subjects. One hundred and eight teacher trainees (hereafter 
called judges) were randomly divided into six experimental sec- 
tions. The subjects were predominately female; there were 18 in 
cach section. 

Experimental situation. Ten men known to be outside the peer 
group of the judges, but of approximately the same age group, 
were tested on the Minnesota Teacher Attitude Inventory (6) 
and the California F Scale (2). Of these ten, two, whose test scores 
were consistent but with marked differences between them (i.e., 
one may be called an authoritarian and the other a nonauthor- 
itarian), were requested to perform certain activities (described 
below) before an 8-mm. movie camera and were tape recorded 
in an individual interview session with the experimenter. Neither 
of these men was informed of the activities they were expected 
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TABLE 2 
F Tests oF SIGNIFICANCE BETWEEN CUE Conpition MEANS AT 
Eacu Prepicrion PorNT 


" Stranger 1 Stranger 2 
Predictions F df $ P df ? 
One 8.44 2/105 .01 3.47 2/105 03 
Two 4.81 2/105 .01 2.43 2/105 = 
Three 1.98 2/105 — 2.41 2/105 = 


means on the first predictions for Stranger 1 
less than the .01 level of confidence. The mean differences relevant 
to Stranger 2 are significant at the .03 level of confidence. The 
corresponding F ratios are 8.44 and 3.47 with 2 and 105 degrees 
of freedom (see Table 2). Differences in mean accuracy of second 
predictions for Stranger 1 are also found to be significant at the 
-01 level of confidence. On the other hand, the mean differences in 
accuracy of second and third predictions for Stranger 2 and third 
predictions for Stranger 1 are not significantly different. 

The significant Fs reported above were further analyzed by 
means of ¢ tests; these results are reported in Table 3. The ¢ tests 
were undertaken to provide a more rigorous analysis of the differ- 
ential effects that each of the cue conditions might have had upon 
the accuracy of the first and second predictions. The results indi- 
cate that the judges using verbal cues for their initial predictions of 
the interests of both strangers are significantly more accurate than 


are significant at 


TABLE 3 
Comparison or Raw Accuracy Means AT F 


IRST AND SECOND 
PREDICTION Points 


, Stranger 1 Stranger 2 
Cue Sequences 1 b ? 
First Prediction 
Habitual-facial Li — 2.22 .05 
Habitual-verbal TTA .01 1.44 = 
Facial-verbal 6.55 01 3.76 01 
Second predictions 


Habitual, facial-verbal, habitual 
Habitual, facial-facial, verbal 
Facial, verbal-verbal, habitual 


rm" 
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the judges utilizing the facial-hand cue condition. The correspond- 
ing fs are 6.66 and 3.76. Furthermore, the judges’ predictions for 
Stranger 1 are significantly more accurate when they used verbal 
cues than when using habitual cues (t = 7.74). On the other hand, 
when predieting for Stranger 2, no signifieant differences in ac- 
curacy are found between judges using verbal or habitual cues. 
Table 3 further reveals that no other significant differences in 
accuracy are found when judges predicted a second and a third 
time for Stranger 2. On the other hand, when predicting for 
Stranger 1 a second time, the judges utilizing facial-verbal cues 
are significantly more accurate (at the .01 level of confidence) 
than the judges predicting from exposure to habitual-facial or 
verbal-habitual cues. The second predictions of judges using 
verbal-habitual eues are signifieantly more accurate (at the .05 
level of confidence) than the predictions of judges utilizing habit- 
ual-facial cues. * 

By employing a 3 X 3 factorial design, it was also possible to 
explore the effects of increments of information on predictive 
accuracy from one prediction to another and the effects of the 
different sequences of cues on the accuracy of the third prediction. 
The results indicate that there are significant differences between 
cach prediction, and there is also a significant interaction between 
the predictions and cue conditions. These relationships are found 
for both strangers. The differences in accuracy when using one set 
of cue sequences as opposed to one of the other cue sequences re- 
veals contradictory results. An F ratio of 3.28 is significant at the 
.05 level of confidence when predicting for Stranger 2, whereas a 
nonsignificant F ratio is reported when predicting for Stranger jm 
There is no readily apparent reason why there should be differ- 
ences in cue sequences for one stranger and not the other. 

The finding of a significant interaction for both strangers and 
an examination of the accuracy means (see Table 1) seems to 
suport the hypothesis that verbal cues are more instrumental in 
affecting the accuracy of predictions than either of the other two 
cue conditions. ; E 

Finally, the differences between successive predietions were 
further analyzed by means of ¢ tests. Table 5 indicates that the 
differences in accuracy of prediction for Stranger 1 are significant 
at the .01 level of confidence between first and second and first 
and third predictions. The differences in accuracy between second 
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TABLE 4 
F Tests or SIGNIFICANCE BErWw THREE PREDICTIONS 
AND THREE Cre SEQUENCES 


Stranger 2 
Source of Variation 


D F df ? 
Between Predictions» 30.65 2/210 .001 9.63 2/210 ..001 
Interaction: Predictions 12.61 4/210 .001 2.93 4/210 .03 
X Cues 
Between Cue sequences 2.02 2/100 — 3.28 2/105 .05 


^ Correlation between predictions Was treated as 
wards (7). 


suggested by Ed- 


TABLE 5 


COMPARISON or MEANS For acu Pre 


DICTION 


Stranger 1 Stranger 2 
Predictions is df P ay (P 
One-Two 4.52 107 .01 2.78 107 .01 
One-Three 6.04 107 -001 3.40 107 .01 
Two-Three 2.12 107 05 59 107 — 


^ttest is based on correlated means. 


and third predictions 
In predicting for Stranger 2, the diffe 
predictions are not Significant, The 
that increasing increm 
effect upon increments 
that a law of diminish 


are significant at the .05 leve] of confidence. 
rences in accuracy between 
means and the ¢ tests indicate 
ents of information do not have a uniform 
in the accuracy of Predictions, It may be 
Ing returns is in operation, 
Summary 

One hundred and eight teacher trainee: 
of two strangers after receiving infor 
quences, about each person. The ye 
first null hypothesis does not hold, 
expressive movements and even his facial expressions when frus- 
trated, although frequently depended 9^, are inadequate criteria 
for personality judgments as compared with listening to him speak 
about his feelings and aspirations. The study weakens but does 
not destroy the other two hypotheses tested, 


S predicted the interests 
nation, in different se- 
sults obtained show that the 
Observing & person's habitual- 


l; 


to 


16. 


I7. 
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SPELLING DISABILITY AND ASYNTAXIA IN 
A CASE INVOLVING INJURY TO THE 
LANGUAGE FORMULATION AREA 
OF THE BRAIN'? 


Jack A. Horwrs 
University of California 
AND WinLIAM HYMAN 
VA Hospital, Long Beach 


In his 1955 presidential address to the APA Division of Edu- 
cational Fsychology, Guy T. Buswell said: “The ultimate ob- 
jective, of course, is not merely to relate [psychological] research to 
currently accepted educational theory, but rather, to do the kind 
of research [in educational psychology] that will help to formulate 
an educational theory that is valid” (1, p. 178). 

In order that learning theory in the field of educational psy- 
chology may itself be valid, however, it must be grounded in the 
neuroanatomy, physiology, and biochemistry of the organism 
that does the learning, i.e., the human being. To this end, Hames 
(2), has stressed the importance of association pathways in lan- 
guage disabilities, and Holmes (6) has worked out the relation- 


ships between the neurophysiology of the brain and the reading 
process. In the present paper, the writers intend to show that 
when, under certain circumstances, neuroanatomical evidence is 
combined with appropriate Psychoeducational behavior, some 
areas of common ground may emerge, 

This being the objective, the reader is asked to consider first 

*Delivered before the California Educational Research Association’s 
Annual Spring Conference, March 1957, at Santa Barbara, California. 

"This work was done under a VA Research Fellowship at the Long 
Beach Veterans Hospital while the Senior author was on sabbatical leave 
from the University of California, Berkeley, The writers express their 
gratitude to the VA for the use of their facilities and Permission to publish 
case material. Likewise, the writers are indebted to J. M. Nielsen, Senior 
Neurological Consultant, not only for bringing this case to their attention, 
but also for his helpful suggestions, 
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the neuroclinieal evidence, then the psychoeducational evidence, 
and finally the integration of the two. 
Case History 

R. J., a 35-year old, right-handed, white, male truck driver was 
hit with a bottle on the left side of his head July 14, 1956. At the 
receiving hospital he showed right-sided weakness, increased deep 
tendon reflexes on the right side, and a right Babinski sign. Angio- 
grams, an electroencephalogram, roentgenograms, and a spinal 
fluid examination were reported as not remarkable, and the pa- 
tient was transferred to the Veterans Administration Hospital in 
Long Beach. An EEG taken at this hospital was suggestive of a 
left-sided extra-cerebral mass—thought to be a hematoma. Be- 
fore further action was taken, however, on August 26, 1956, the 
patient signed himself out of the hospital against medical advice. 

The patient was readmitted to the VA Hospital on January 2, 
1957. In the interval since his last admission, five months before, 
he had been bothered by transient occipital headaches, lacrimation, 
and diplopia. His speech had become impaired, and he complained 
of numbness on the right side of his face, tinnitus and pain in his 
right ear, 


Hospital Course 


A pneumoencephalogram then showed the left lateral ventricle 
of the brain to be moderately dilated. The neurological picture 
Was consistent with unilateral cerebral atrophy on the left. Tre- 
phination was done bilaterally in the temporoparictal regions. Vis- 
ual examination through the “burr holes” did not reveal the 
expected subdural hematoma. 

Shortly after these diagnostic procedures had been completed, 
the patient began to show a gradual clinical improvement. How- 
ever, two defects persisted. He had difficulty in arranging his 
spoken words in proper order, and he was unable to write spon- 
taneously. 


Neuroclinical Examination 


The cerebral localization of the patient’s present aphasia is 
indicated by the following clinical examination: 


1. The patient is able to recognize both animate and inanimate objects; 
therefore, he does not have visual agnosia for objects, and we assume areas 
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17 and 18 to be intact. Figure 1 depicts the various 
cortex referred to during this discussion. Sinee R 4 imis 
by revisualization, objects which he describes in color and de tal it appe: 
that we are justified in also assuming area 19 to be intact (7, 8). 

2. The patient is able to understand s] 
shows no auditory-verba] agnosia, 
mann’s areas 41 & 42) to be intact (7, 8). 

3. The patient is able to identify 
his eyes are closed ; therefore, he 
the hand sensory area in the 
Superior parietal. 

4. The patient is able to c. 
by the command, 


reas dn or on the 


eas also able to recall. 


oken commands: therefore, he 
and we assume Wernicke's area (Brond- 


objects placed in his hand even though 
manifests no tactile agnosia, and we assume 


post-central gyrus to be Intact, as is also the 


ürry out such 
"Light and smoke aci 
he does not have ideational or ideokinet 
does not have a diffuse cortic 


a sequence of acts as is demanded 
garette.” This would indicate that 
le apraxia, and we assume that he 
al involvement or a deep lesion in the supri- 
marginal gyrus (area 40) on the left side. Likewise, since he d 
to express his thoughts, he does not have cortical motor-pattern apraxia of 
the mouth,:and we assume Broca's aren (area 44) to be intact, 

5. The patient's verbal responses manifest 
lation. When asked, “What kind of work wouk 
"I want to drive for a truck.” When Inquiring 
again, he said, “If ean work, how long 
regarding his previous employment, 
Janitor], and I was working truck, 
with syntax and it cannot be 


is able verbally 


difficulties in language formu- 
l you like to do? 
as fo the 
and for work? 
he said, 


AS he unswered, 
possibility of working 
" In response to a query 
^I was clean man [meaning 
too." He, therefore, has obvious trouble 

med that aren 37 on the major side is intaet. 
Drodmann's area 37 is located, as Figure | shows, the postero-lateral 
portion of the temporal lobe and, as first Pointed out by T Ree ie 
concerned with the formulation of language and with dintin. Die anbi, 
on the major side, coordinates word-symbol impulses from various por- 
tions of the brain and organizes them into the syntactienl nein or iin 


coherent sentence. An injury to this language formulation arca would result, 
ae k culty with Syntax, grammar, rhetoric, or 


itient's having diffi 
Style (7, 8). 


7. The patient is able to add and subtract mentally, 
numbers and solve problems on paper; therefore, he she 
we assume his lesion has not affected the border of 
or the gyri of Gratiolet (7, 8). 

8. R. J., as mentioned above, experiences no difficulty 
Likewise, he can Copy, using the right hand, letters, words sentences, and 
drawings: therefore, he does not have agraphia for these, 1 
that Exner's writing center with its corresponding 
are intact (7, 8). 


He can write the 
shows no ucaleulia, and 
the interparietal suleus 


in writing numbers. 


ünd we ean assume 

motor area for the hand 

It is somewhat misleading to Speak of a « 

“area” for any linguistic function 35 We have done for actually 
1 a 


any such function is so complicated that it takes many parts of 
the brain working together to execute such a verbal act, demon- 


center” or specific 
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fissure of Rolando 


Medulla 


Fic. 1. BropMaxn’s Areas Inpicatep BY Number (See Text ror FUNCTIONAL 
INTERPRETATION ) . 
strate a linguistie skill, or express a symbolic thought. Such a 
"eenter" as Exner's writing center, of course, could not write by 
itself: on the other hand, without it, the rest of the brain could 
not write either. 

In contrast to the patient's ability to write numbers, he ex- 
ulty in trying to write the alphabet 
and finds it entirely impossible to write words either singly or in 
a sentence. As will be shown later, this is true whether the task 
is initiated from an oral command, a written command, or from 
a self-initiated desire to express himself. From these findings, it 
is clear that the patient does have agraphia for linguistic sym- 
bols. However, since the patient can write numbers and ean copy 
ssumed that the basic difficulty rests in his inability 
(asyntaxia), and if this is correct, it would 
elsen's theory (7, 8), a lesion 


periences a great deal of diffie 


words, it is : 
to formulate language 
be because he has, according to Ni 
in area 37 on the major side. 


Psychocducational Examination 
to increase our understanding of the relationship of 


ation to psychocdueational phenomena, the fol- 
esting was done: 


In order 
cerebral localiz I 
lowing intensive differential t 
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1. On a Dot-Embedded Configuration Test (3), which ASSESSES the pe! 
to see a dotted outline of a symbol when it is “hidden” in n breton o 
dots, R. J. did relatively well, scoring 78 correct out of 108 items tried: 

2. On a Dot-Line Configuration Test (3), which 


assesses the ability to 
revisualize mentally and com 


plete a symbol when only minimal cues are 
given, the patient did very poorly, scoring 30 correct out of 108 items tried. 
This tells us the patient has trouble with “closure” 
the “closing” information from memory, 

3.Ona Concept Formation Test (9), which assesses the 
from 10-item-sets subgroups of four words that 
they are all associated with the 
fairly well, scoring 29 correct, 11 

4. On a Simple Language Fi 
subject/s ability to select fr 


when he has to supply 


ability to select 
"belong" together because 
same concept or idea, the patient did only 
Wrong, and omitting 8. 

ormulation Test (9), which assesses the 
om a choice of four possible words the best 


one to fill a blank left in a sentence, the patient did relatively well, scoring 
43 correct, out of 60 items, N 


ote that in this test recognition rather than 
recall is the important requirement, 
5. Ona Matching 


select paired homon; ery well in the simple part of the 
test, scoring 24 out of 30; but he did very poor 


i ly in the complex section, 
scoring only 9 out of 30 items tried, 


subject. visually to 


tence. On this form, R 

Form II of the Sy; 
requires the subject 
member the order, a 


ntax Test presents the same scrambled sentences but 
to reformulate the w 


: Ord order in his own mind, re- 
1 nd write the best Sentences as the response for each 
item. The patient did very poorly on this form, scoring only 7 correct 
out of 25 items tried. By the use of a fixed hand mirror, the patient's eye 
movements were observed to make many excursions fiom stimulus to re- 
sponse during the writing of each letter. In other words, not only the 
memory of the word order of the Sentence, but also the memory for the 
letter order of the word did not appear to be functioning for the purpose 
of writing; therefore, one is forced to question thig patient's ability to 
spell. 


the sentences in their original mixed-up order, but he simply could not 
straighten them out! 


* The standardized norms are not applicable here, because it was felt 
that more information could be gai 1 


e gained by using the instruments as power 
rather than as speed tests. The patient was Cooperative in all tests. 
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From the three forms of the Syntax Test, it becomes quite evident that 
the real trouble is in language formulation and perhaps letter formulation 
or spelling from recall, and the difficulty is not with the writing per se. 

7. To this end, then, 2 High School Spelling Test (3), consisting of 35 
words of increasing difficulty, was administered to the patient in four different 
forms: 

Form I of the Spelling Test simply required the subject to recognize 
and encircle the one word in four that was correctly spelled. On this 
mutiple-choice version, the patient did surprisingly well, scoring 20 correct 
responses out of 35 items. Table 1 presents, in the first four columns, the 
choices for the five easiest items and the patient's responses to them. It 
should be noted that they are all correctly encircled. 

Form II of the Spelling Test required the subject to write these same 
words from dictation. Column 5, Table 1 shows the patient’s best efforts 
on the first five easiest items. His total score was “0.” 

Form III of the Spelling Test required the subject to “peck out” the 
letters on the typewriter in order to spell these same words from dic- 
tation. It seemed reasonable to believe that if the patient had the letters 
before him he might be able to select them in the proper order to fit the 
formulation of at least those 20 words which Form I indieated he knew 
how to spell correctly. However, even this proved to be an impossible 
task for him—his score was “0.” Column 6, Table 1 gives the responses 
made by R. J. on the same five items. 

Form IV of the Spelling Test required the subject to spell the words 
aloud from dictation. In spite of the fact that writing was in no way 
involved, R. J. was not able to spell a single word correctly. Column 
7, Table 1 gives the verbal responses made by the patient to these same five 
words. 

8. The patient did very poorly on a Phonetie Association Test (3), scoring 
10 out of 50 items attempted. 


Discussion 


There is a case reported in 1953 which in many ways is strik- 
ingly parallel to the one being reported here. In other ways, how- 
ever, there are marked differences. 

Trowbridge (10) opens his article with the interesting state- 
ment: “The syndrome of loss of ability to write with preserved 
ability to read, is almost unknown in the field of aphasie disorders. 
It is the reverse of pure word blindness (inability of the patient to 
read what he himself has written)." 

Trowbridge’s patient was a 27-year old, white, navy man who 
had suffered a cerebral thrombosis involving branches of the left 
posterior cerebral artery. Six months later, the only aphasie re- 
siduals “persisting, were disorder in spelling and writing." There 
was no anomia, auditory-verbal, nor visual-verbal agnosia; there 
was no alexia, acaleulia, finger agnosia, confusion of laterality, or 
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TABLE 1 


R. J.’s Responses on Five Easiest IrEMS or tHe HiGH Scnoon 
SPELLING Test UNDER Four DIFFERENT CONDITIONS 


: " Recall from dictation 
Recognition from visual inspection 


(Multiple-choice Spelling Test) 


Writing Typing Oral 


1 2 3 4 5 6 7 


ee | — 


1. brav brve brev Larr berk da 


2. dinner dinr ines) denr Aeron diinir dinnird 
3 (foney) hony huny honny | Ju hucy hogny 


4. pont pount gynt anam | tite poyn 
5. theaf theif thefe || clea! cobc thef 


^ Typed by examiner from patient’s oral responses. 
' 


confusion of body scheme, He was able spontancously to formulate 
speech conveying very complex thoughts. Unlike the present pa- 
tient, Trowbridge's patient had no difficulty with either language 
or word formulation, However, both patients are similar in that, 
while they can copy words, neither ean write them, 

Trowbridge's patient could spell “automatically,” but not when 
he “directed his thoughts toward spelling complex words.” In 
this patient, the neurological examination indicated that the lesion 
was in the angular gyrus (area 39) on the major side. An unusual 
situation developed when the patient was asked to try to write 
with his left hand. While he insisted it was impossible, to every- 
one’s surprise he did 80 very well! 
tioning, that the patient as 
learned to read before 
changed to right-hande 


It was revealed, upon ques- 
a child had been left-handed. He 
entering school. In school he was forcibly 


dness for writing. All his life he used the 
left hand for handling tools, but he had always written with his 


right hand. A left angular £yrus lesion then “produced an isolated 
loss of writing ability in the right hand, while the corresponding 
area on the right continued to perform normally for reading and 
other functions governed by this area, Writing with the left hand 
was possible as a result of the normal amount of skill present in 
this hand, together with the high level of functional development 
present in what would have been ordinarily the dominant angular 


gyrus” (10). R. J., on the other hand, was not able to write 
words with either hand. 
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Summary 


A ease report is presented in which the clinical evidence indi- 
cates that the patient had originally suffered a traumatic intra- 
cerebral hemorrhage, whieh had absorbed, with resulting impair- 
ment of area 37 on the major side. The relationships of area 37 to 
the other pertinent areas of the cortex concerned with aphasia are 
discussed and the possibility of their involvement eliminated or 
indicated. The psyehoeducational significance, resulting from dam- 
age to the language formulation area (Brodmann’s area 37) on the 

is revealed by intensive differential testing. 

i Psychoeducationally, the most interesting finding is the 
evidence which indicates that the writing from dictation type of 
spelling ability and the proofreader's type of spelling ability are 
not one and the same ability. The significance of this finding takes 
on added weight when it is recalled that Holmes (5), in 1954, 
e samples of high school and college students 
ating that these two 


major side, 


working with larg 
was able to present statistical evidence indie 
while highly related, were in faet not 
utilizing some of the 
able to show that the 


on type of spelling 


types of spelling ability, 
the same. By a substrata-factor ana 
same tests used on this ease, Holmes was 
subskills underlying the writing from dietati 
ability depended more upon auditory images, while the proof- 
reader's type of spelling ability depended more upon visual images. 

2. One hesitates to draw major conclusions from a single case. 
However, the evidence in the present study seems to indicate 
that spelling ability as rev valed by the ability orally to spell or 
to write words correctly from dictation calls upon à language 
formulation area (presumably area 37), while the proofreader's 
Lype of spelling ability to detect whether or not 2 word is spelled 
correctly by visual inspection does not require the use of this 
area, This being the case, then: 

3. The most significant theoretical implication that can be drawn 
udy and the substrata-factor analysis of spelling 
area of spelling, an empirical relationship 
substrata factor theory (4, 5) 


from this case study 
ability is that, in the 
has been established betw 
alization (7, 8). l 
4. The evidence indicates that agraphia (amnesie agraphia) 
asie inability to formulate the letters of 
actually knows when they are 


cen the 
and cerebral loc 
can be caused by a b 


words, even though the patient 
spelled correctly. 
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5. The term asyntazia is suggested as a word describing the 
linguistic loss resulting from injury to the language formulation 
area (area 37) of the brain. 
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Roperr W. Warre. Abnormal Personality. New York: Ronald 
Press. 1956. Pp. 64. 

How to present the rich materials of abnormal psychology in a 
humanized manner, intelligible to all who can read and are in- 
terested, is the challenge the author set up for himself in the first 
edition published in 1948 and has again adopted as his task in 
edition published in 1956. His aim, he says, is to present 
the subject in such a way as to emphasize the usefulness to all 
students of human nature, and it is because of this desire that 
he sees fit to include two introductory chapters, one historical 
and one clinical. The chief difference in the two editions is the 
reduction in content devoted to consideration of dreams and 
hypnotism and expansion of material concerned with more recent 
experimental studies and what their findings implicate. The 
author presents similar material in a more systematic manner 
with increased attention to developmental stages and an attempt 
to take an explicitly neo-Freudian position, balancing his earlier 
reliance on Horney with ideas drawn from Fromm and Sullivan. 
The case studies used include a maladjusted adolescent person, a 
neurotic, a delinquent, a psychotic, and a person with a progressive 
brain disease; and this material is used by him to advantage m à 
consideration of development and integration of personality. 
How can we find out how the difficulties of life can make one 
person maladj usted, another neurotic, another delinquent, another 
psychotic and how another passes through comparable difficulties 
without troubles? To bring order, system, and understanding 
into facts as diverse as the illustrative cases is the purpose of the 


whole book. . : . . 
The author shows 2 familiarity with the literature of all topics 
covered and takes an eclectic attitude toward all conflicting issues. 
He considers the problem of therapy fairly thoroughly for an 
hotherapy notas a hopelessly complex 


elementary text, and sees psy ) ] | 

theme, but one that has to do with removing blocks in the learning 
, 

process and promoting new growth where the processes are partly 


understood and wholly understandable. Therapies he pays a 
good deal of attention to are client centered, psychoanalysis, 
and briefer psychoanalysis. Of nondirective therapy, he says it 
works on a client in just one way and neglects the possibility of 
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using other approaches in a wise and timely fashion. He gives it 
as his impression that for persons with a serious neurosis, measures 
most patiently worked out in Freudian psychoanalysis are likely 
to be more promising in the long run. The basie processes con- 
sidered by him as essential in all psychotherapy include (a) initial 
therapeutic relationship, (b) expression of feelings, (e) pointing 
out of feelings, (d) transference, and (e) behavior resulting from 
treatment. 

There are chapters on the various mental disorders and chapters 
on development which are well done. The last chapter deals with 
the problem for society and represents a compact consideration of 
the meaning and implications for mankind and society of mental 
disease and health problems. His general conclusion is that the 
preventing of abnormal behavior is one in which every citizen can 
participate. 

All in-all, Dr. White has set up for himself a relatively difficult 
task and to a large extent has fulfilled it. To the reviewer, it is 
like a modernized version of Morg: 


zan's earlier efforts when less 
was known as given by him in Psychology of Abnormal People— 


the emphasis is on people and the knowledge given is definitely 
for use in understanding people. The title A bnormal Personality 
is somewhat of a misnomer since jt is conce 
with an emphasis on diversity, rather than 
Such a thing as the abnormal 
purposes only. A more hone. 
normal Personalities. 


rned more with people 
the thesis that there is 
personality. But that is for caption 
st title might be Understanding Ab- 


, H. MELTZER 
Human Relations Research Foundation, St. Louis, Missouri 


J. P. Guinrorn. Psychometric Methods (2nd. Ed.). New York: Mc- 
Graw-Hill, 1954. Pp, 597. 

When the first edition of this book appeared in 1936 specialists 
in educational and psychological measurement were jüst beginning 
to court mathematics and theory seriously. Thurstone headed 
the newly formed Psychometrie Society, Wows journal, Psy- 
chometrika, began that year, Kuder’s kindred journal Educational 
and Psychological Measurement, did not start until five years later. 

Guilford’s comprehensive new edition reflects changes during 
the eighteen-year interval and contains a great deal more material 
than its predecessor. New chapters on the theory of psychological 
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measurement and psychophysical theory build a logical basis 
for psychophysical and psychologieal sealing methods. Early 
in the book Guilford devotes 43 pages to explaining simple mathe- 
maties and curve fitting. He warns, however, that “it was neces- 
sary to eliminate most of the statistical treatments given in the 
first edition.” One must go elsewhere for these, he says, preferably 
to his Fundamental Statistics in Psychology and Education. ` 

In this second edition Guilford attempts to account for a great 
many phenomena of human judgment by means of Helson’s 
concept of adaptation level. The new chapters on theories of 
psychological tests, reliability and validity of measures, and test 
development serve as vehicles for presenting recent advances. 
Also, much updating occurs in the chapters whose titles were 
altered relatively little. Finally, appendixes now include tables 
showing significant values of r, R, t, and F and incorporate other 

i 


modifications. 

Among the newcomers to Guilford’s pages are Cronbach, 
Gulliksen, and S. S. Stevens, who have become prominent since 
1936. We find little reference, however, to contemporary mathe- 
matical statisticians (none to Rao, for example). Latent structure 
analysis seems not to be mentioned. Coombs’s recent theory and 
techniques get only slight notice. These and similar points are 
consonant with Guillord's explicit disavowal of ultra-modernity: 
“this volume represents the general field of psychological 


measurement. as of about 1950” (p. v). But one must stop some- 


where! 

The procedures th 
upon large-sample theory, nor! du 
moment correlation, mirroring certain limitations of statistical 
knowledge during the carly days of psychology when psycho- 
physical and psychological sealing methods were being developed. 
Professional workers and graduate students in education and 
psychology are indebted to Guilford for the time and effort he 
spent revising this textbook-handbook-manual. They will find 
the chapters on pair(ed) comparisons, rating scales, and factor 


ly valuable!. 


at Guilford explains lean rather heavily 
nality, and Pearson product- 


analysis particular 
JuuiaAN C. STANLEY 


University of Wisconsin 


1 Prospective users should read 
tre 
Clemens Johnson, and Bert Green, 


1 the excellent reviews by John Keats, 
respectively, in Educ. psychol. Measmt. 
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Ernest R. HILGARD. Theories of Learning: (2nd Ed.). New York, 
Appleton-Century-Crofts, 1956. Pp. 563 

This edition represents a thorough revision of the widely read 
1948 book. Four new chapters have been added dealing with (a) 
Freud’s contributions to the psychology of learning, (b) the use of 
mathematical models, (c) reinforcement and drive, and (d) “some 
persisting issues.” These new chapters account largely for the 
approximately 140 added pages, although the 1948 chapter on 
Wheeler's contribution has been dropped. The book carries 60 
pages of bibliography, and almost half of the titles did not appear 
in the 1948 edition. 

Hilgard’s general orientation, as he developed his chapters, is 
much the same as it was in the earlier book. His exposition is 
built “around the psychologists who Bave currency to the several 
theories”—Thorndike, Guthrie, Skinner, Hull, Tolman, Wert- 
heimer,' Koffka, Köhler, Lewin, Freud, Carr and Woodworth. 
Most of the chapters begin with a brief and somewhat contempo- 
raneous history of the development, of the “position,” which is 
followed by an analysis of the major experiments, a critique of the 
essence of the point of view, and an estimate of its bearing upon 
questions such as: What are the limits of learning? What is the 
role of practice in learning? How important are drives and in- 

and punishments? What is the place of under- 


something else? Wh; s ember 
forget? Each chapter i ith an excellent bibliography. 

Hilgard writes lucidly and his approach is generally objective, 
chorough, and critical. j 


h He Judges Hull’s work to be the best 
etfort at "hypothetico-deducti 


ive system-making” to appear in the 
first half of the cent Mo bir sas against 
too much emphasis i odels are suggested, and 
the likelihood that « ich material from psychoanaly- 
sis has eluded the academi ists” (p. 307) is both la- 
mented and explained, i 
Theories of Learning is written for an American audience 
and is probably best adapted to graduate students who are taking 


(1955, 15, 215-218), J. Amer. stat. 


Assn (1956, 51, 
(1955, 20, 163-165). 


413-414), and Psychometrika 
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their first course devoted exclusively to the psychology of learning. 
Hilgard’s personal position is eclectic and he makes a successful 
effort to indicate that there is something to be gained from each 
and all of the theoretical positions he develops. None of the sys- 
tems, in his judgment, is invulnerable to criticism and the “con- 
struction of a fully satisfactory theory of learning is likely to re- 
main for a long time an uncompleted task.” 

For educational psychologists who are primarily interested in 
finding out more about learning that can be applied in the teaching- 
learning situation, Iilgard's book does not have a great deal to 
offer. This is true even though he concludes with a discussion of 
“the relation between learning and the practical demands of 
instruction and training” (pp. 485-90). The contention is made 
that there are a number of important propositions about learning 
on which a majority of theorists would agree, and which do have 
clear though rather general implications for teachers. These propo- 
sitions are not intended to be revolutionary. Some of them, para- 
phrased a bit, go like this: 1. In deciding who should learn what, 
the capacities of a learner are very important. 2. Motivation 
that is too intense may be accompanied by distracting emotional 
states. 3. Learning under the control of reward is usually preferable 
to learning under the control of punishment. 4. Learning under 
intrinsie motivation is preferable to learning under extrinsic 
motivation. 5. Tolerance for failure is best taught by providing 


ss that compensates for experienced failure. 


a background of succe: i 
verlearning 


6. There is no substitute for repeated practice in the o 
of skills. 7. Transfer to new tasks will be better if in learning 
the learner can discover relationships for himself (pp. 486f). 
In connection with his discussion of practical implications, 
Hilgard is, in this reviewer’s judgment, unusually fair and moderate 
in his conception of the so-called conflict between pure and applied 
research. He neither believes that applicational research can wait 
until there is a pure science, nor that the existence of the pure 
applications (p. 489). “When 


science will automatically result in l 
the relationships between pure and applied research are properly 


worked out, the requisite skills are of a high order. However, 
the labor is divided, and there is no reason why prestige should 
not be more equitably distributed” (p. 490). This, from so dis- 
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tinguished a source, gives some support to those who are disposed 
to believe that one of the best ways for the teacher to learn about 


learning is for him to experiment with his own teaching. 


STEPHEN M. Corny 
Teachers College, Columbia University 


Paur K. Crosser, The Nihilism of John De wey. New York: 
Philosophical Library, 1955, 

The most interesting, as well as the most comprehensible, 
part of this book will be found in the first nine paragraphs of the 
preface. In this passage the author presents a bill of particulars 
with respect. to the outrages which John Dewey has perpetrated 
on the civilization of the West. 

It is made manifest, that an obsession which pursued Dewey 
throughout his life was nothing less than the destruction of 
philosophy. How near he came to achieving this insane ambition 
can be judged by the condition in which Mr. Crosser now finds us. 
The American intellectual is adrift, the American artist's outlook 
is fragmentary, and the American teacher is “enmeshed in un- 
related details.” And if this Were not enough, he reports also 
that America’s perspective has been destroyed and her influence 
in the world diminished accordingly, 

It would be a dull fellow indeed who could 


over how such havoc was Wrought single-handed by as mild and 
amiable a soul as John Dewey is reputed to have been, It is likely, 
therefore, that many readers wil] turn With anticipation from 
the preface to the body of the text. Those who are not in the habit 
of reading prefaces will be in on the secret, from the beginning— 
that this is a monument to dullness. At least the preface reader 
can enjoy for a few moments his illusion that at long last. the 
anatomy of destruction is to be laid bare, 
: The node of ite pages of it js devoted to shred- 
ing large chunks o three of Dewe s tition deala. ic. The 
Theory of Inquiry, Art as Experience, sb Lt be i-i 
tion. The technique of what its author optimisticall calls “this 
critical analysis” is to cite a Sentence or phrase ien Desar and 
then envelop it in the heaviest sarcasm to he found outside a 
sophomore bull session. 


à à 

s | sessio Sarcasm, Particularly when it is prolonged 
r 232 pages, is a " SIE 1 
Or pages, is diffieult for even a good writer to m 


suppress his curiosity 


anage—and 
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in truth Mr. Crosser is not to be numbered among these. Sarcasm, 
used sparingly and with finesse, can be deadly to an opponent. 
But here it serves only to obscure the argument and to create an 
atmosphere of petty. conten tiousness, which would be distasteful 
even to one who was sympathetic to the argument itself. 

The analysis is directed towards certain aspects of Dewey's 
theory of knowledge, his theory of art, and his theory of education. 
The author maintains that in all of these Dewey’s nihilism is 
The chapter titles are indicative of the tenor of the 


apparent. 
“The Degradation of Knowledge," “The Dissipation 


exposition: 
of the Sense of Beauty," and “The Exclusion of Schools.” 

The blunt instrument which was used in the initial crime against 
the West was the kind of "eognitive indeterminableness" which 
is alleged to be the root of Dewey's villainy. Through it the natural 
and social sciences have been struck down. Subject and object, 
cause and effect disappear into a cognitive limbo. The inevitable 
result is extreme relativism. (Crosser always puts this term in 
italics lest we forget that it is a bad thing.) 

The rest of. catastrophe follows hard on this. Objec 
darkness, followed quickly by the artist. 
finition of estheties 


ts of art 


are consigned to outer 
The sense of beauty is lost and the very de 
isma of extreme relativism. 


becomes obscured in a mi 
l education the situation 15, 


With respect to the state of forma 
if anything, worse. The price we have paid for our cognitive un- 
school as an institution is fast dis- 
appearing. Our concept of education as formal learning of organized 
knowledge is all but forgotten. In our extreme relativism we are 
rn what is educative experience and what is 
not. The decline of the West is all but accomplished. 

It is too early to discern how telling this blow will be against 
the philosophy of nihilism, but it seems likely that Dewey's 
idy been diminished in one respect. It has 
i ! quarters that Dewey had the worst 
TONS (lis vas (rue (and 
env it), it is hardly true now. Dewey hàs 
i f this thesis the following passage 


determinableness is that the 


even unable to disec 


reputation has alre: l 
long boon alvuvaled Ham 
prose style to be fount 
there are many who d 
met his mateh, and in support 9 
from page 119 is offered in evidence: 

“The cognitive violence which the removal of att objects às 
art objects as an analytical factor from the kind of art theory e 
iS about to evolve is of such serious nature that the return of 


558 Book Reviews 


the art object into the fold of inquiry, after it had been left out 
of consideration in the evolvement of the basic principles of that 
inquiry, cannot restore to that kind of inquiry the character of an 
inquiry into the arts.” 


G. Max Winco 
University of Michigan 


Wituram E. Henry, The Analysis of Fantasy, New York: John 
Wiley & Sons, Inc., London: Chapman & Hall, Limited. 
1956, Pp. 305. 

The Analysis of Fantasy is a consideration of the use of thematic 
apperception technique in the analysis of personality. The title 
is smarter as is, and to some extent can be said to be justified 
since the thematic apperception is one of the relatively effective 
methods of getting at fantasy for the sake of interpreting symbolic 
behavior. The material is presented in three parts. It is concerned 
with, respectively, the process of interpretation, illustrative ana- 
lyses, and description of the pictures, There is an appendix which 
includes six pages of notes on the administration of the test and a 
very comprehensive and specifically classified bibliography of 
the thematie apperception technique, which takes up nearly 
30 pages. Though well written, it does assume interest and per- 
sistence on the part of the reader as well as some familiarity with 
the background and use of the thematic apperception tests for 
the study of personality as a whole as well as fantasy. It is a fact, 
as the author says, that the interpretive significance of the themas 
perceived and stories told are derived from the understanding 
of the ways in which the emotional Preoccupations are symbolized 
in words and plots, from the y ays in which emotions bow to 


V. 
propriety in overt expression, or ways in which each individual 
lessens his own burden by expressing his own feelin s through 
disguising them for himself and others. To the E 
where fantasy rules, logie and pr 

through the analysis of 


: irectly į 
is considered at some length. Y 1n the use of the test 


Variables of form and content are treated in separate chapters. 
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Content refers to what the subject said and form to how he said it. 
And both categories are set in a conceptual framework for in- 
dividual case analysis wherein comparisons are made with Rors- 
chach formulations. The cases presented include a record of a 
healthy, middle class boy, a 29-year-old man, a Dakota Sioux of 
14, and a Navahogirl of 11. In the chapter on the stimuli properties 
of the pictures, each card is described under six captions: Murray’s 
Description; Manifest Stimulus Demand; Form Demand; Latent 
Stimulus Demand; Frequent Plots and Significant Variations. 

This is a highly worthwhile volume for all people interested in 
the use of projective technique in the study of personality and for 
the thematic appereeption tests so used particularly. Investigators 
of any phase of problems related to the validation or the use of the 
thematic apperception technique will find the bibliography very 
helpful. 

H. MELTZER 


Human Relations Research Foundation, St. Louis, Missouri 


Orro PoLLAK. Integrating Sociological and Psychoanalytic Concepts: 
An Exploration in Child Psychotherapy, New York: Russell 
Sage Foundation, 1956. Pp. 284. 

Effective application of social science concepts and approaches 
can make a significant contribution towards the improvement of 
diagnosis and therapy in child guidance work. This is the author’s 
thesis, and the material used and the interpretations made are 
all slanted in the direction of proving this thesis. To a fair extent, 

the whole story is the report of Dr. Pollak’s experience with a 

child guidance team. A clinical psychologist, because of illness, 


was not included in the team. f 

In this volume, the author raises a number of interesting prob- 
lems that are relevant in considering ways of improving child 
guidance efforts and results. The much to-do about the neglect 
of the father in child guidance work is not new but is televant: 
The concepts he advocates the use of, which in the author’s 
way of thinking represents a reformulation of method, are four 
in number: family orientation, social interaction, social role, and 


cult flict. s 
ure con: ume is presented in four parts. Part I, 


The content of the vol 1 L ; 
which is called “Family Pathology, Family Diagnosis and Family 
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Therapy with Child Welfare Focus,” is illustrated with the pres- 
entation of two case studies. Part IT, called “The Therapeutic 
Management of Extrafamilial Factors’ is illustrated with three 
ease studies. Part III, called “A Research Outlook” covers psy- 
chiatric literature and differences between theory and practice. 
Part IV is called “The Agency Looks at the Project” and is an 
evaluative comment by Herschel Alt, Executive Director of 
Jewish Board of Guardians, which, with the Russell Sage Founda- 
tion, made possible this project. 

Some of the concepts stressed by Pollak are worthy of con- 
sideration. Some are over-generalizations. For example, when 
he says that a child welfare focus would result in consideration 


of situations rather than individuals so that we would not have 
sick husbands or s 
or sic 


1 k wives, but sick marriages; not sick children 
k mothers or sick fathers, but sick families; not sick teachers 
or sick students, but sick school situations; not immature, sick 
guardians or sick administrators, but sick institutions, he is calling 
attention to the 


advisability of emphasizing emotional climate 
of groups rather than emotional climate of individuals; but how 
much reality is he running away from, as well as getting into, 
by so doing? It may be a fact that the “bad child” so-called, 
referred to by the mother as such, is best conceived as being in 
= made “a by the mother and all things in the form of 
uman surroundings at home. It doesn’ 7 rever, that 
in all instances there are no such e mh Eee ii 
some institutions do permit, almost invite sickness in the be- 
havior of the people who live in them. It also doesn’t follow that 
even in such instances that some people who come there have 
been made sick before they came there. The universal implication 
of the author’s emphasis would be to have an institution to house 
intitas rather than individuals. To use a word which appeared 
in the “Talk of the Town” in a recent issue of the New Yorker, 
the author has done too much “jumboizing” 
by so doing has run off the track reality, as w ivi 
some realities deserving ei nano el a made vive 
On the assumption that things well done sh 
vival value rather than þe reformulated or 


are lige ee omissions here. Included aro: No mention of the 

work 2: "ri ; loft to 1 i 
ep. I "d or Fritz Redl; no attention to the organization of 
arias and attitudes towards the use of them; no consideration 
) E ERU 


of his concepts and 


ould be given sur- 
thrown out, there 


f 
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of economic factors. A greater awareness of the work realities of 
the clinical team and a lesser emphasis on verbal reformulations 
would more likely yield a more relevant and enriched exploration 
in child psychotherapy. 
H. MELTZER 
Human Relation Research Foundation, St. Louis, Missouri 


Frank A. Locas, Davin L. OrwsrEp, Burton R. Rosner, 
Riewarp D. Scuwarrz, & Cant M. Srevens. Behavior 
Theory and Social Science, New Haven: Yale University 


Press, 1955. 

A group of Ford fellows in behavior science with a three-year 
appointment at the Institute of Human Relations of Yale Uni- 
versity have produced this volume which is an attempt to restate 
the Hullian point of view with regard to behavior theory and to 
make applications of this theory to problems in social science. 
The five authors of this volume, according to their own statement, 
met as a group five days a week throughout the first year for 
ess frequently during the second and 
They also acknowledge 
agues at Yale 


several hours a day and | 
third years in which the report was written. 
btedness to a large number of other colle: 


their inde 
could discuss Hull’s point of view and its applica- 


with whom they 
tions. 

It is indeed significant that this is one of 
wo years to present an extended analysis 
There has been a kind of race to see which 
of the theories propounded by Guthrie, Hull, Skinner, and Tolman 
would have the pre-eminence. It is not certain by any means 
that the issue has been finally settled and history will probably 
record that cach of these psychologists has made his enduring 
contribution. Certainly at the present time, however, Hull's 
theoretical structure intrigues the largest number of psychologists 
and there has been greater elaboration of his theories by other 
psychologists than of any of the other current learning and be- 


havior theories. 
One chapter in the 
of the respons; and 


Stimulus and other anti 
Language," the writers go beyont 


a number of books 


to appear in the past t 
of Hull's point of view. 


book is devoted to discussing the properties 


another ehapter is given to diseussion of the 
; seat conditions. In chapter ñ, entitled 
| the basit principles undet- 
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lying Hull’s point of view to a discussion of thinking and problem 
solving. Chapter 6, entitled “Free Behavior," deals with the same 
phenomena that Skinner discusses under the title of operant 
behavior; that is, behavior which is in response to stimuli not 
immediately present. These authors prefer to call this free be- 
havior, and they make a strenuous effort to fit it into the Hullian 
framework. Since the results are somewhat speculative, they have 
drawn up three models of such behavior and then state that 
these should be the subject for much experimentation in order 
to determine which of the three models comes closest to fitting 
actual experience. The last two chapters in the book deal with 
social interaction and cultural behavior. Here there js still more 
speculation, but always attempting to derive an explanation of 
social behavior from the basic Hullian proposition. 

This reviewer can recommend this book to serious students of 
learning and behavior theory as a discussion whic 


h moves our 
theoretical formulation a step ahead. 


PERCIVAL M. Symonps 
Teachers College, Columbia University 
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